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PREFACE 


This  volume  contains  the  technical  proceedings  of  the  First 
International  Interactive  Workshop  on  Inversion  Methods  in 
Atmospheric  Remote  Sounding,  held  in  Williamsburg,  Virginia, 
December  15-17,  1976.  Seventy-three  invited  scientists  from  seven 
countries,  representing  universities,  research  laboratories,  and 
U.S.  Government  agencies,  participated  in  the  Workshop.  The  pur- 
pose of  the  Workshop  was  to  provide  an  interdisciplinary  forum  to 
review  and  assess  the  state  of  the  art  in  Inversion  Methods  avail- 
able for  retrieving  information  about  the  atmosphere  from  remotely 
sensed  data. 

Twenty-one  invited  papers  covered  the  mathematical  theory  of 
Inversion  Methods  as  well  as  the  application  of  these  methods  to 
the  remote  sounding  of  atmospheric  temperature,  relative  humidity, 
and  gaseous  and  aerosol  constituents.  The  emphasis  was  on  the 
assumptions , methodology , resolution , stability , accuracy , and 
future  efforts  needed  in  the  various  Inversion  Methods.  Also 
included  are  invited  papers  on  the  Direct  Radiative  Transfer 
Methods  and  results  relevant  to  the  Inversion  Problem.  The  latter 
were  presented  in  a special  session  on  Radiative  Transfer  Methods, 
held  jointly  with  the  Optical  Society  of  America  Topical  Meeting 
on  Atmospheric  Aerosols,  which  preceded  the  Workshop.  One  of  the 
major  Workshop  objectives  was  to  enable  researchers  in  different 
areas' of  atmospheric  remote  sounding  to  compare  and  optimize  the 
utilization  of  these  inversion  procedures  in  their  respective 
remote  sounding  techniques.  Ample  time  was  allowed  for  dis- 
cussions following  each  paper  and  in  two  open  discussion  sessions. 
This  fulfilled  an  important  objective  of  the  Interactive  Workshop. 
Discussions  presented  were  recorded  and  the  transcripts  post- 
edited.  Each  discussant  edited  his/her  portion  of  the  statements 
with  the  aim  of  improving  its  clarity  without  changing  its  sub- 
stance . 

Since  NASA  is  involved  in  developing  several  remote  sensing 
experiments  designed  to  monitor  the  atmospheric  constituents  and 
properties  from  aboard  space  platforms,  the  undersigned  suggested 
to  M.  P.  McCormick,  Langley  Research  Center,  that  organization  of 
an  Interactive  Workshop  dealing  with  the  mathematical  aspects  of 
Inversion  Methods  will  greatly  benefit  all  researchers  concerned 
with  inversion  and  radiative  transfer  methods.  He,  along  with 
J.  D.  Lawrence,  Jr.,  Langley  Research  Center,  and  M.  Tepper,  NASA 
Headquarters,  concurred  and  supported  the  idea  with  the  result 
that  the  undersigned  undertook  the  assignment  of  organizing  such 
a workshop  with  the  goal  of  making  the  proceedings  of  the  Workshop 
readily  available  to  the  scientific  community. 

To  ensure  proper  representation  of  major  disciplines  involved, 
a Workshop  Program  Committee,  composed  of  A.  Deepak  (Chairman) , 

Old  Dominion  University;  M.  P.  McCormick  (Associate  Chairman) , 
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NASA  Langley  Research  Center;  B.  M.  Herman,  University  of  Arizona; 
J.  D.  Lawrence,  Jr.,  NASA  Langley  Research  Center;  and  M.  Tepper, 
NASA  Headquarters,  was  set  up.  The  Committee  was  ably  assisted  in 
this  endeavor  by  the  following  Program  Consultants:  M.  T.  Chahine, 

Jet  Propulsion  Laboratory;  B.  J.  Conrath,  NASA  Goddard  Space 
Flight  Center;  A.  L.  Fymat,  Jet  Propulsion  Laboratory; 

J.  Russell,  III,  NASA  Langley  Research  Center;  and  E.  Westwater, 
NOAA/Environmental  Research  Laboratory. 

Dr.  Tepper  opened  the  Workshop  stressing  the  significance  of 
inversion  problems  to  NASA  and  its  programs  involved  in  the 
monitoring  of  atmospheric  environments  of  the  Earth  and  other 
planets.  The  challenges  inherent  in  the  inversion  problem  were 
perhaps  best  characterized  by  his  analogy  that  the  problem 
of  the  Inversion  Method  was  like  that  of  unscrambling  an  egg, 
wherein  one  investigates  the  scrambled  egg  to  determine  what  it 
was  like  originally. 

The  undersigned  wishes  to  acknowledge  the  enthusiastic 
support  and  cooperation  given  him  by  the  participants,  the  mem- 
bers of  the  organizing  committee,  the  program  consultants,  the 
session  chairmen,  and  the  speakers  in  making  the  Workshop  a very 
stimulating  and  valuable  experience  for  everyone.  Special  thanks 
are  due  M.  P.  McCormick  whose  wholehearted  cooperation  and  active 
support  as  Associate  Chairman  assured  the  success  of  the  Workshop. 
Commendations  are  due  the  Science  and  Technical  Information  Program 
Division  and  especially  the  Technical  Editing  Branch,  for  their 
cooperation  and  high  quality  of  workmanship  in  publishing  this 
Proceedings.  Last  but  not  least,  it  is  a pleasure  to  thank  and 
highly  commend  the  superb  job  done  by  Mrs.  M.  "Sue"  Crotts  both 
in  helping  with  the  organization  of  the  Workshop  and  with  the 
excellent  quality  of  typing  of  the  manuscript. 

Behind  every  successful  remote  sensing  technique  is  at  least 
one  reliable  Inversion  Method.  I hope  these  Proceedings  will  be 
a lasting  contribution  to  the  field  of  Inversion  Methods. 


Adarsh  Deepak 
Editor 
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HYBRID  METHODS  ARE  HELPFUL 


H.  C.  van  de  Hulst 
Leiden  University 


A basically  simple  problem  like  multiple  scattering  in 
a plane  layer  often  permits  the  convenient  use  of  different 
methods  joined  together.  Sample  numerical  results  to  illus- 
trate this  point  refer  to  X-  and  Y- functions , asymptotic 
fitting,  the  small-loss  approximations,  polarization  in 
high  orders,  and  photon  path  distribution. 

I . INTRODUCTION 

Methods  to  solve  problems  in  radiative  transfer  and  in  multiple 
scattering  exist  in  such  a wide  variety  that  I shall  not  attempt 
another  review.  In  any  practical  problem,  the  method  must  be 
chosen  on  the  basis  of  expediency,  and  this,  in  turn,  depends  on 
many  factors,  such  as:  range  of  variables;  desired  accuracy  of 

results;  occasional  or  frequent  computations  needed;  cost,  avail- 
able funds;  and  experience  and  taste.  I emphasize  in  this  paper 
the  fact  that  in  many  situations  a hybrid  approach  containing  ele- 
ments from  different  methods,  though  not  **elegant"/  is  the  most 
practical.  A noimial  rose  or  fruit  tree  consists  of  different 
varieties  skillfully  grown ‘ together  because  the  desired  properties 
of  roots  and  fruits  (or  flowers)  are  not  met  in  a single  variety. 

Before  illustrating  this  point  with  a number  of  examples  taken 
from  Ref.  1,  I wish  to  make  a general  remark.  We  all  have  learned 
to  respect  the  power  of  mathematics.  Solving  a problem  in  mathemat- 
ical physics  often  is  like  going  somewhere  by  train.  The  mathematics 
is  like  the  train:  we  enter  at  a station  called  equation  and  we  get 


Fig.  I.  A schematic  diagram  illustrating  the  advisability 
of  accessing  from  physical  concepts  to  intermediate  results^  whose 
interpretation  may  be  as  important  as  that  of  final  numbers. 


off  at  a station  called  solution  (see  Fig.  1) . Once  inside  the 
train,  we  can  relax  and  look  out  of  the  window,  for  nothing  much 
can  go  wrong.  In  contrast,  the  roads  from  our  home  to  the  station 
and  from  the  last  station  to  our  destination  may  be  time-consuming r 
uncomfortable,  or  even  hazardous.  My  point  is  that  in  many  problems 
the  train  is  suburban:  it  stops  at  certain  intermediate  stations. 
Boarding  the  train  at  one  of  those  stops  may  be  quite  as  safe, 
respectable,  and  economic  as  entering  at  the  initial  station  which 
in  our  topic  is  called  equation  of  transfer.  At  any  rate,  it  is 
worth  a try  to  find  out  which  is  simplest.  Likewise,  it  often 
pays  to  evaluate  and  physically  interpret  certain  intermediate 
results  which  is  the  same  as  getting  off  before  the  end  station. 

The  philosophical  message  is  that  combining  physics  with 
mathematics  makes  a hybrid  method  anyhow.  Therefore,  we  might  as 
well  experiment  a little  to  find  the  most  convenient  connection. 
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II.  THE  X-  AND  Y-FUNCTIONS 


Consider  any  given  landscape  (Fig.  2)  with  an  isotropic  light 
source  placed  at  a point  P and  no  other  source  of  illumination. 
Looking  at  this  landscape  from  a distance,  say  from  the  direction 
Q,  we  see  a blob  of  light  in  which  the  source  itself  (dimmed  or  not) 
may  still  be  discernible.  We  define  the  gain  (from  P to  Q and  con- 
versely) as  the  intensity  that  reaches  Q from  source  plus  illumi- 
nated landscape  divided  by  the  intensity  that  would  reach  Q from 
the  bare  source  placed  at  the  same  distance. 

This  definition  is  given  in  preparation  of  a discussion  of  the 
well-known  X-  and  Y-functions  for  isotropic  scattering.  These 
functions  were  introduced  by  Ambartsumian  in  the  early  forties  and 
extensively  studied  by  Chandrasekhar  about  1945,  who  defined  them 
as  solutions  of  certain  simultaneous  nonlinear  integral  equations. 


Fig.  2.  A schematic  sketch  of  a "landscape"  with  an  iso^ 
tropic  source  at  P for  illustrating  the  concept  of  gain  between 
point  P and  a direction  Q, 
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Far  simpler,  in  the  gain  definition  just  given,  we  may  take 
as  the  "landscape"  a homogeneous  slab  of  isotropically  scattering 
particles  with  optical  thickness  b,  the  source  P just  outside  the 
slab,  and  the  direction  Q subtending  an  angle  6 with  the  normal. 
The  X-function  then  is  the  gain  with  the  source  seen  in  front  of 
the  slab  and  the  Y-function  is  the  gain  with  the  source  (dimmed) 
seen  through  the  slab.  This  is  all  there  is  to  it:  no  problems 
of  existence  or  uniqueness  if  we  board  the  train  at  this  station. 
Both  functions  depend  on  three  variables:  b,  y = cos  0,  and  a = 
the  albedo  for  single  scattering. 

I was  quite  pleased  when  in  1947  I rediscovered  these  defi- 
nitions,  from  which  Ambartsumian  had  started,  and  found  that  cer- 
tain properties  of  these  functions  can  be  far  more  easily  derived 
from  these  physical  definitions.  Since  that  time,  I do  not  hesi- 
tate to  use  the  two  approaches  mixed. 

Figure  3 shows  a selection  of  values  of  these  functions.  The 
Y-function  usually  is  less  than  unity  because  the  blob  of  scat- 
tered and  multiply-scattered  light  does  not  fully  compensate  the 
dimming  of  the  direct  source.  The  X-f unction  always  is  greater 
than  unity  because  it  includes  a term  unity  arising  from  the 
unobstructed  light  from  the  source.  The  X-function  for  a semi- 
infinite atmosphere  (b  = °°)  usually  is  called  the  H-function.  If 
b = and  a = 1,  all  radiation  incident  on  the  atmosphere  is 
returned  as  diffusely  reflected  light  after  many  scattering  events. 
If  the  landscape  were  a mirror  reflecting  all  incident  radiation, 
we  would  see  the  source  double  from  any  direction,  which  would 
mean  X(y)  = 2.  The  diffuse  reflection  leads  to  the  same  average 
value  of  2 for  H(y),  but  the  distribution  with  y is  different, 
ranging  from  H(0)  = 1.0  to  H(l)  = 2.908. 

As  a counter  example,  in  which  the  physical  picture  is  of 
little  use,  I mention  the  extension  of  the  H-function  to  arguments 
outside  the  domain  y = 0 to  1.  Such  an  extension  is  needed  in  a 
variety  of  problems.  It  is  then  convenient  to  plot  the  inverse 
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Fig.  J.  Reminder  of  the  dependence  of  the  X-  and  Y-functions 
for  Isotropic  scattering  on  the  variables  hr  a,  and  y.  The  X- 
function  for  b = °°  is  called  H- function. 


function  {H(y)}  ^ against  the  inverse  argument  y \ Figure  4 gives 

> 

an  example  which  happens  to  refer  to  anisotropic  scattering  but 
this  makes  no  essential  difference.  We  see  that  the  graph  con- 
tinues to  curve  down  for  y ^ < 1 and  reaches  0 at  y -k,  where 
k is  the  diffusion  exponent,  i.e.,  the  value  for  which  ^self- 
consistent  solution  to  the  transfer  equation  in  an  unbounded  medium 
exists  in  which  the  dependence  on  optical  depth  t is  given  by  the 
factor  exp(±  kx) . The  values  and  slopes  at  y ^ = 0 and  y ^ = -k 
occur  in  several  standard  problems. 

We  have  taken  these  illustrations  from  isotropic  scattering 
and  one  example  from  very  simple  anisotropic  scattering.  Phase 
functions  of  arbitrary  form,  or  phase  matrices  with  polarization, 
require  a more  elaborate  set  of  formulas.  Yet,  the  situation 
remains  basically  the  same:  carefully  preparing  the  access  at 
intermediate  stations  usually  pays  off  in  clarity  or  in  speed  of 
computation . 

A final  remark  on  the  X-  and  Y-functions  for  single-scattering 
patterns  of  arbitrary  form  is  that  these  same  functions  appear  in 
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Fig.  4.  A plot  of  1/H(]x)  as  a function  of  i/y  depicting  the 

best  way  for  visualizing  the  behavior  of  the  H-function  beyond  the 

usual  domain  (7  ^ y ^ i for  linearly  anisotropic  scattering  with 

(0=0)=  0.9. 
o 1 


conceptually  quite  different  methods.  The  sketch  in  Fig.  5,  which 
we  shall  not  explain  in  detail,  shows  four  important  methods  of 
solving  radiation  transfer  problems.  In  the  second  method, 
labeled  "invariant  embedding,"  the  X-  and  Y-f unctions  are  intro- 
duced to  describe  the  effect  of  a narrow  layer  added  to  one  or  the 
other  side  of  a slab.  The  method  of  singular  eigenfunction  expan- 
sion works  from  an  entirely  different  concept,  in  which  the  com- 
plete set  of  eigenfunctions  for  the  unbounded  medium  is  first 
established.  The  proper  coefficients  of  each  that  match  the 
boundary  conditions  are  then  found  by  applying  orthogonality 
relations  and  it  is  in  the  course  of  establishing  the  half-range 
orthogonality  relations  that  the  X-  and  Y-functions  have  to  be 
introduced. 
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Fig.  5.  Schematic  representation  of  some  commonly  used 
methods  to  solve  problems  of  multiple  scattering  or  radiative 
transfer . 


Ill-  ASYMPTOTIC  FITTING 

Before  describing  this  method,  I wish  to  convey  by  means  of 
Fig.  6 an  impression  of  the  range  of  variables  in  which  the  sim- 
plest approximations  suffice.  I refer  to  the  legend  for  details. 

The  diagram  shows  that,  if  the  scattering  is  conservative,  or 
nearly  so,  a wide  range  of  four  decades  in  the  optical  thickness  b 
exists,  in  which  we  cannot  in  practice  say:  the  layer  is  very  thin, 
or  it  is  infinitely  thick.  This  is  annoying  because  the  conver- 
gence of  almost  any  method  is  small  for  large  b.  This  point  is 
illustrated  for  the  successive  scattering  method  in  Fig.  7. 

The  incidence  is  normal  in  this  example  and  the  scattering  is 
isotropic,  so  that  the  source  function  for  first-order  scattering 
Jj  is  1/4  e in  both  examples.  In  the  right-hand  side  example,  the 
layer  thickness  is  1.  At  each  successive  scattering  there  are 
losses  at  both  sides  and  the  net  effect  is  that  after  some  five 
scattering  events,  the  distribution  has  become  symmetric  and  drops 
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Fig,  6.  Representation  of  the  range  of  the  variables  b and 
a in  which  the  simplest  approximations  suffice.  The  very  simplest 
approximations  are  valid  with  an  acfcuracy  better  than  1%  in  the 
shaded  areas  in  this  diagram:  left  single  scattering , right  the 
equations  for  a semi-infinite  atmosphere.  The  next  approximations , 
shown  by  hatched  areas / are  left  single  plus  double  scattering  and 
right  the  thick-layer  asymptotic  formulae ^ again  to  a 1%  accuracy. 
The  corresponding  5%  limits  are  shown  by  dotted  curves.  Scales  are 
linear  in  log  b and  Vl  - a and  the  quantity  treated  is  the  plane 
albedo  for  normal  incidence  with  isotropic  scattering . 

with  every  further  scattering  by  a constant  factor  0,619-  The  con- 
vergence, then,  is  as  a geometric  series  with  this  ratio.  However, 
on  the  left-hand  side  example,  the  thickness  is  b 10,  Here,  the 
convergence  seems  rapid  near  T = 0,  but  is  not  at  all  visible  yet  at 
T > 3.  Eventually,  the  convergence  will  be  as  a geometric  series 

with  ratio  0.976  (for  b = 10),  or  0,993  (for  b = 20),  or  as  a 

-3/2 

series  with  terms  proportional  to  n (for  b = °°)  • In  any  case, 

this  convergence  is  too  slow  to  make  the  method  of  successive 
scattering  attractive. 
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Fig,  7.  Demonstration  of  the  convergence  of  the  method  of 
successive  scattering  by  plots  of  the  source  function  against 
optical  depth  t in  successive  orders.  The  example  refefs  to  per- 
pendicular incidence  on  a layer  with  isotropic  scattering.  Left: 
total  depth  ^ 10 r slow  convergence.  Right:  total  depth  1,  rapid 
convergence , 


In  this  domain,  we  have  elaborately  used  the  doubling  method. 

It  is  clear  that  by  th^  doubling  method  we  can  in  ten  steps  bridge 
the  range  of  optical  thickness  b = 1/32,  1/16,  1/8,  ...,  8,  16,  32, 
but  that  it  is  silly  to  try  to  carry  this  process  to  b = <»,  which 
we  can  never  reach.  Instead,  we  wish  to  use  the  known  asymptotic 
forms  of  the  reflection  and  transmission  functions  for  sufficiently 
large  b.  The  grafting  of  the  different  approach  (asymptotic  theory) 
onto  the  numerical  computation  (doubling)  makes  a hybrid  compu- 
tational method  and  we  have  called  this  method  asymptotic  fitting. 

It  uses  the  doubling  results  in  exactly  the  same  way  as  we  would 
use  measured  data  to  find  the  constants  in  an  equation  of  known 
form. 
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The  approach  to  such  a known  form,  prescribed  by  asymptotic 
theory  is  well  illustrated  by  Fig.  8.  Again  the  example  is  simple 
namely  isotropic  scattering,  and  the  ordinate  is  the  source  func- 
tion. It  is  seen  that  not  quite  at  b ==  4,  but  certainly  at  b = 8 
and  higher  there  is  a range  of  t in  which  the  graph  is  straight. 

In  this  "diffusion  domain"  the  source  function  goes  as  exp(-kT)  as 
in  an  unbounded  medium.  The  diffusion  domain  is  the  range  of  t 
far  enough  from  the  top  side,  where  injection  takes  place 
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Fig,  8,  The  source  function  is  plotted  against  optical  depth 
in  six  situations  which  differ  only  by  the  assumed  thickness  b of 
the  layer  and  in  which  the  single  scattering  albedo  a = 0,90  and 
the  cosine  of  the  angle  of  incidence  are  kept  constant.  From 
b = 8 up  the  curves  show  a straight  middle  portion  (diffusion 
domain)  and  the  asymptotic  laws  are  valid. 


(t  = 0 to  3) / and  from  the  bottom  side,  where  the  escape  of  radi- 
ation becomes  noticeable  (t  = b - 2 to  b) • Using  this  knowledge 
and  taking  the  results  of  three  successive  steps,  say  b = 8,  16 
and  32,  yields  all  desired  functions  and  constants  for  a semi- 
infinite medium - 

IV.  THE  CORNER  OF  SMALL  LOSSES 

In  the  problem  of  reflection  and  transmission  by  a homogeneous 
slab  of  scattering  particles,  I like  to  display  the  results  in  a 
diagram  of  a,  the  albedo  for  single  scattering,  against  b,  the 
optical  thickness  of  the  slab.  Such  a representation  is  chosen  in 
Fig.  9.  This  figure  may  be  regarded  as  a companion  to  Fig.  6, 
because  it  shows  in  a different  way  (roughly  delineated  by  the 
dotted  curves)  in  which  area  we  may  say  that  the  result  is  as  for 
b = °°  and  in  which  area  we  may  say  that  the  result  is  as  for  a = 1. 


b 


Fig.  9.  The  full  domain  of  albedo  a and  layer  thickness 
b Is  displayed  on  linear  scales  of  /l  - a and  (1  + b)^^ . The 
curves  show  the  values  of  the  reflected  flux  portion  (plane 
albedo)  for  normal  Incidence  with  Isotropic  scattering. 
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At  one  corner/  defined  by  a = 1/  b = <»/  the  reflection  is  com- 
plete. Small  losses  by  escape  from  the  bottom  of  the  slab  occur  if 
b is  not  quite  Small  losses  by  absorption  in  the  atmosphere 
occur  if  a < 1.  These  losses  are  not  additive  and  their  combined 
behavior  has  posed  nasty  problems  in  many  papers,  numerical  and 
analytical  alike.  Yet,  the  situation  is  simple  if  we  refer  to 
asymptotic  theory.  Both  of  these  losses  have  the  same  dependence  on 
angle  of  incidence  and  escape  because  they  both  occur  in  deep  layers. 
They  add  in  a way  which  is  universal,  except  for  scale  factors. 

This  is  shown  in  Fig.  10  which  is  an  enlarged  portion  of  the  top 
right-hand  corner  of  Fig.  9,  The  dotted  curves,  which  have  known 
tangents  in  the  corner  domain,  now  indicate  the  exact  loci  where 
the  losses  by  escape  from  the  bottom  (i.e.,  into  the  black  ground) 
and  the  losses  by  imperfect  scattering  in  the  atmosphere  have  a 
fixed  ratio. 

V.  POLARIZATION  IN  HIGH-ORDER  SCATTERING 

The  fine  visual  observations  of  polarization  of  the  planet 
Venus  at  various  phase  angles  published  in  Lyot's  thesis  in  1929 
have  outlasted  40  years  before  the  first  tentative  interpretation 
could  be  replaced  by  a more  definite  one.  Lyot  conjectured  that 
polarization  might  be  completely  absent  in  all  but  first-order 
scattering.  This  guess  was  not  correct-  Since  1970,  the  accuracy 
of  both  the  measurements  and  the  calculations  have  been  greatly  ’ 
improved.  Most  authors  now  agree  that  the  polarization  curves  of 
Venus  measured  at  different  wavelengths  present  convincing  evidence 
of  concentrated  sulfuric  acid  as  the  main  constituent  of  the  drop- 
lets. 

Let  us  return  to  the  basic  problem:  how  much  polarization  can 
be  present  in  the  light  arising  from  double  and  multiple  scattering? 

I have  worked  out  an  extreme  practicing  example  based  on  Rayleigh 
scattering.  Radiation  is  assumed  to  fall  on  a semi-infinite  atmos- 
phere under  60^  from  the  normal  and  we  seek  to  compute  the  intensity 
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Fig.  10.  Enlarged  portion  of  a corner  of  Fig.  9.  Heavy 
curves  give  the  combined  loss  = nonre fleeted  fraction  of  Incident 
flux  and  dotted  curves  show  the  ratio  of  the  two  kinds  of  loss. 

The  curves  near  the  loss-less  corner  have  a universal  form  Inde- 
pendent of  the  phase  function. 

and  polarization  reflected  back  to  the  source  (as  for  part  of  a 
planet  in  opposition) • The  degree  of  polarization  in  this  example 
is  0%  in  the  first  order,  9%  in  the  second  order,  and  surprisingly 
goes  up  even  further  to  13%  and  14%  in  the  third  and  fourth  orders 
before  it  settles  to  2.2%  in  very  high  orders. 


If  now  we  add  a factor  to  the  nth  order  and  take  the  sum, 
we  obtain  the  intensity  diffusely  reflected  back  in  each  polariza- 
tion with  a = single,  scattering  albedo  (Fig.  11) . The  effect  just 
mentioned  then  shows  up  as  a maximum  polarization  of  6.1%  for 
a = 0.95.  The  simplest  way  to  collect  these  results  was  to  com- 
bine data  obtained  by  very  different  methods. 


VI.  PHOTON  PATH  DISTRIBUTIONS 


The  understanding  of  many  problems,  for  instance,  the  for- 
mation of  planetary  absorption  lines,  is  greatly  aided  by  a clear 
knowledge  of  the  photon  path  distributions.  It  is  well  known  that 
such  distributions  may  be  obtained  by  an  inverse  Laplace  transform 
from  the  dependence  of  the  reflection  function  on  a (albedo  for 
single  scattering) . This  knowledge  may  be  put  to  use  in  various 
ways:  the  inverse  Laplace  transform  may  be  applied  to  any  form  in 
which  the  reflection  function  is  known — analytic,  numerical,  or 
asymptotic.  A systematic  exploration  of  this  possibility  has  made 
it  possible  to  combine  smoothly  the  results  for  very  low  orders, 
where  ad  hoc  calculations  are  fast,  with  those  of  high  orders, 
where  an  asymptotic  approach  is  more  appropriate. 

I shall  show  two  examples,  both  referring  to  finite  layers 

with  isotropic  scattering.  For  very  low  orders,  it  is  possible 

to  derive  the  explicit  form  of  the  photon  path  distribution.  This 

is  done  for  n = 1 and  n = 2 in  Fig.  12  (with  b=l,  y=y^=l)  on 

the  basis  of  computations  made  by  Irvine.  It  is  much  easier  to 

2 

derxve  only  the  average  path  <\  > and  the  variation  a from  the 

n n 

average  path.  A smooth  distribution  curve  with  a convenient  form 
and  with  the  correct  average  and  variation  then  is 


P„(A) 


m 

r(k) 


e 


-mA 


where  m and  k must  be  solved  from 
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Fig,  11,  Construction  of  the  curve  o. 
gle  scattering  albedo  a for  reflection  . 
d direction  against  a semi -infinite  Ray. 
illuminated  under  60^  with  the  normal 
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Fig,  12,  Average  photon  path  length 
'lecting  against  or  transmission  by  a fi 
'ing  layer  with  optical  thickness  b. 


k = m<X  > = m^a  ^ 
n n 

and  r(k)  is  the  gamma  function. 

I have  plotted,  in  Fig.  12,  as  dotted  curves  these  distribu- 
tions for  n = 2 and  n = 5.  The  fact  that  the  dotted  curve  for 
n = 2 comes  already  close  to  the  exact  curve  means  that  at  higher 
n for  most  practical  problems  we  do  not  have  to  worry  about  the 
exact  curves  but  can  use  the  approximation  with  full  confidence. 

The  larger  the  total  number  of  scattering  events,  the  larger 
the  combined  optical  path.  In  an  unbounded  medium,  we  simply  have 
an  average  path  length  per  scattering  event  equal  to  1.  In  order 
to  show  the  transition  from  very  thin  to  very  thick  layers,  I have 
therefore  plotted  in  Fig.  13  the  average  path  length  divided  by  n. 
Again,  exact  results  for  n = 1 and  n = 2 have  been  combined  with 
exact  asymptotic  results  (value  and  slope)  near  n = “.  Note  that 
toward  large  values  of  n,  the  photon  again  "forgets”  from  which 
side  it  came  so  that  the  curves  for  transmission  and  reflection 
converge  toward  the  same  value  and  become  tangent.  For  very  thin 
layers  (b  ^ 0.1) , it  is  immaterial  from  the  start  whether  we  con- 
sider reflection  or  transmission. 
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Figr.  13.  Path-length  distribution  of  radiation  diffusely 
reflected  back  in  vertical  direction  after  n scattering  events 
from  a slab  of  isotropically  scattering  particles  of  optical  thick- 
ness 1 exposed  to  vertical  illumination.  The  exact  forms^  known 
for  small  n,  rapidly  approach  an  asymptotic  theory, 

SYMBOLS 

a single  scattering  albedo 

b optical  thicknes.s 

H X-function  for  a semi-infinite  atmosphere  (b  = <»  ) 

J source  function  for  first-order  scattering 

k ± 1/y 

photon  path  distribution  function 

R R 

Z'  r intensity  for  the  polarized  components  (subscript  Z is  for 
parallel  component;  and  r is  for  perpendicular  component) 
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l/(b  + a) 

X-function  for  isotropic  scattering 

Y-function  for  isotropic  scattering 

gamma  function 

zenith  angle 

wavelength 

average  path 

cos  0 

variation  from  the  average  path 
optical  depth 
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DISCUSSIONS 


Green:  I wondered  if  your  slide  that  you  did  not  talk  about  had 

something  to  do  with  multiple  scattering  and  path  lengths? 

van  de  Hulst:  Yes,  this  slide^  shows  the  average  path  length  per 

scattering.  It  is  simplest  to  think  that  you  have  done  a Monte 
Carlo  calculation  and  separate  the  paths  out  by  the  number  of 
scatterings.  Those  which  have  two  successive  scatterings,  you 
divide  by  two.  And  those  which  have  five  successive  scatterings 
you  divide  the  total  path  length  by  five.  Then  that  comes  at 
infinity  to  a certain  value  which  is  not  too  difficult  to  calculate. 
These  have  been  combined  in  the  graph  with  definite  values  shown 
as  dots.  When  the  thickness  is  not  very  large,  in  transmission 
and  in  reflection,  you  get  approximately  the  same  path  lengths. 

And  also,  if  you  go  to  the  very  large  n,  the  light  doesn't  know 
anymore  from  which  side  it  came,  so  the  transmission  and  reflec- 
tion curves  again  coincide. 


See  Fig.  13 


in  this  paper. 
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REVIEW  OF  RADIATIVE  TRANSFER  METHODS  IN 


SCATTERING  ATMOSPHERES 


Jacqueline  Lenoble 

Universite  des  Sciences  et  Techniques  de  Lille 


The  problem  of  radiative  transfer  in  a scattering 
plane-parallel  atmosphere  is  discussed , considering  the 
exact  analytical r the  computational  and  the  approximate 
methods.  Some  results  of  numerical  comparisons  are  given. 
Finally^  the  difficulties  of  realistic  atmospheric  models 
are  emphasi zed . 


I.  INTRODUCTION 

The  problem  of  radiative  transfer  in  scattering  atmospheres 
can  roughly  be  divided  into  two  cases:  (1)  when  the  plane- 

parallel  approximation  is  acceptable  a great  deal  of  methods  are 
now  available,  and  it  was  worth  trying  a comparative  review  of 
the  methods  and  numerical  comparisons  between  them;  and  (2)  when 
the  atmosphere  cannot  be  assumed  to  be  plane-parallel  (twilight, 
finite  clouds)/  the  problem  reaches  a much  higher  degree  of  com- 
plexity and  there  is  no  more  a question  of  choosing  between 
existing  methods,  but  of  seeking  the  development  of  new  methods. 

II.  EQUATION  OF  TRANSFER  (Ref.  1) 

The  general  equation  of  transfer  has  the  matrix  form 

0)  (r) 

Q.VI  (r,Q)  = K(r)  {-  I(r,^2)  + — //  P(r;^2,f2')  I(r,fi*)  dai' 

+ (1  - w^(r))]g(r)}  fl) 
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where  is  the  radiance  matrix  at  a point  (r)  in  the  direc- 

->■ 

tion  (U)  . is  the  phase  matrix;K(r),  the  total  extinc- 

— 

tion  coefficient  (absorption  + scattering);  and  co^(r),  the  single 
scattering  albedo.  Jg(r)  is  the  source  function  due  to  thermal 
emission;  it  will  be  neglected  in  this  paper. 


The  radiance  I(r,^^)  is  governed  by  equation 
f2.Vl(r,f2)  = K(r) 


IQ  lU 

+ v(?,n')}  doj'  I 


(2) 


and  depends  on  the  three  other  Stokes'  parameters  Q/U,V. 

Neglecting  the  polarization,  the  approximate  equation 


— 

tA)  (r)  r 

f2,VI(r,f2)  = K(r)  \ 


p(r;n,fi')  L,(r ,$!' ) du' 


can  be  used;  p{r,Q,^*)  is  the  phase  function. 


(3) 


Table  1 shows  an  example  of  the  error  done  in  this  approxi- 
mation; it  can  be  of  the  order  of  10%  for  molecules  and  very  small 
particles,  but  becomes  negligible  for  particles  with  a Mie  param- 
eter a larger  than  4 or  5 . 


III.  HOMOGENEOUS  PLANE — PARALLEL  ATMOSPHERE 
A-  Generalities 

The  simplest  case  is  an  homogeneous  atmosphere  limited  by  two 

infinite  parallel  planes  and  illuminated  on  its  upper  boundary  by 

the  solar  beam.  The  only  position  variable  is  z or  better  the 

optical  depth  x = / K(z)dz,  where  Z is  the  altitude  of  the  upper 

^ .2 

boundary;  the  total  optical  thickness  is  x^  = K(z)dz.  The 
direction  ^ is  characterized  by  the  zenith  angle  0 = Arc  cos  p and 
the  azimuth  angle  c|). 
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TABLE  1 

Error  in  Percent  When  Neglecting  Polarization 
in  the  Computation  of  I 


0) 

o 

Rayleigh 

a = 1 

a = 2' 

a = 5 

0.99 

5.29 

5.37 

1.19 

0.07 

0.9 

8.40 

9.40 

3.77 

0.27 

0.8 

9.17 

10.86 

5.91 

0.46 

0.6 

8.19 

10.51 

8.25 

0.58 

0.4 

5.89 

8.04 

7.59 

0.48 

0.2 

3.06 

4.40 

4.13 

0.24 

The  equation  of  transfer  is  in  this  case 


„ MM., 


•2it 

0 


f+1 


p(yr<|);y'  r<P') 


-1 


0) 


T/y. 


X I (t;m’ ,(|)' ) dy'  d<()'  + -^  p(y,<|);y^,(f>^)F  e ° (4) 

where  ttF  is  the  sun  ir radiance  at  the  upper  boundary  on  a plane 
perpendicular  to  the  solar  direction  • Here,  I refers  to 

the  radiance  of  the  diffuse  flux  excluding  the  direct  solar  beam. 
The  boundary  conditions  are 

I (0  ; y < 0 / <j))  = 0 f ^ (5) 

and  given  by  ground  reflection. 

Integrating  Eq.  (4)  with  Eq.  (5)  we  get  the  integral  form  of 
the  equation  of  transfer 


Ti  -T 


rT, 


+ - 
U 


t-T 


J (t;y,(())e 


dt 


(y  > Q i) 


(6a) 
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I = I 


T/y  1 


rT 


t-T 

y 


J(t;y,(j))e  dt 


( y <0  ^ ) 


(6b) 


with 


J(T;p,<i>)  = — 


oj  ^2u  /•+! 
o 


p(y /<}>;y  ' ) I(T;y ',(()')  dy*  d^' 


-1 


“o 

+ P(Pf  F e 


= source  function 


(7) 


The  transmission  (S)  and  reflection  (T)  functions  are  defined 


by 


i'*’(o  ;+y /((>) 

I (T2;-y/(|))  = 4~ (8b) 

They  are  useful  when  the  main  interest  is  in  the  outgoing  (dif- 
fusely transmitted  or  reflected)  radiation. 

We  have  seen  that  the  equation  of  transfer  contains  the  phase 
function  p(9)  as  kernel-  This  phase  function  is  either  given  by 
a table  of  nximerical  experimental  values  or  by  a mathematical 
expression.  Between  the  different  possible  forms,  Eq.  (9)  gives 
the  expansion  in  Legendre  polynomials: 

P'**’  • ^ -0®<l  '5' 

its  main  advantage  is  to  allow  an  expansion  in  azimuth  of  the 
radiance  (and  of  all  the  radiation  parameters) : 

I(t;p,(())  = - (2  - 6 ) I®(t;p)  cos[s(4i  - <}.  ) ] (10) 

S “ L/  OS  O 

using  Eq.  (10)  with 
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(11) 


p(y,());y'  ,(fi' ) 


(2  - cos[s((})  -<(.')] 


the  equation  of  transfer  with  three  variables  can  be  split  into  a 
system  of  equations  with  only  two  variables 


o 
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e,  p*(u) 


s o 


T/y, 


F e 


0) 

o 
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f+1 


-1 


= s I®(T;y')  dy' 


s — 0/  1 • • • f L (12) 

I will  give  here  a brief  description  of  the  methods  for  the 
plane  parallel  case;  it  follows  a report  prepared  by  a Working 
Group  of  the  Radiation  Commission  (Ref.  2). 


B.  Exact  Analytical  Methods 

By  exact  analytical  methods/  we  understand  the  methods 
leading  to  a solution  for  the  radiance  in  terms  of  mathematical 
functions,  which  have  finally  to  be  tabulated;  therefore,  the 
accuracy  of  these  methods  may  not  be  better  than  the  accuracy  of 
more  direct  numerical  methods.  Their  main  interest  is  in  the 
understanding  of  the  mathematical  structure  and  of  the  general 
behavior  of  the  solutions.  Their  basic  drawback  is  the  difficulty 
to  use  these  methods  in  the  case  of  a real  atmosphere. 

Among  these  methods  we  will  classify  the  Singular 
Eigenfunctions  (or  Case)  method,  the  Wiener-Hopf  method,  and  the 
reduction  to  H-  or  X-  and  Y-  functions  which  can  be  founded  on 
the  principles  of  invariance.  This  reduction  is  straightforward 
only  for  very  simple  phase  functions,  such  as  isotropic  or 
Rayleigh  scattering.  In  these  cases,  many  accurate  tables  of  the 
H-  and  X-,  Y-functions  have  been  computed  and  this  makes  the 
method  a reference  for  testing  other  methods  in  the  simplest  cases. 
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C.  Computational  Methods 

By  computational  methods , we  understaind  methods  specifically 
designed  for  computers,  but  they  may  include  some  analytical 
treatment  before  the  numerical  procedure,  as  is  the  case  for 
the  spherical  harmonics-  The  spherical  harmonics  solution  is 
based  on  a discrete  spectrum  of  eigenvalues  v and  can  be  seen  as 
an  approximation  to  the  exact  Case  method  which  uses  a continuous 
spectrum-  The  discrete  ordinates  method  is  indeed  very  similar 
to  the  Spherical  Harmonics.  In  the  Monte  Carlo  method,  one  photon 
at  a time  is  followed  on  its  path  through  the  atmosphere  and  each 
event  is  defined  by  a probability  distribution.  The  Dart  method 
uses  a discretization  in  radiation  streams  whose  arrangement  is 
based  on  a regular  dodecahedron. 

The  successive  scattering  is  based  on  an  iteration  starting 
from  the  primary  scattering.  The  Gauss  Seidel  method  uses 
another  possible  scheme  of  iteration  with  about  the^  same  advan- 
tages and  drawbacks.  The  matrix  operator  is  based  on  the  inter- 
action principles:  the  reflection  and  transmission  of  the  layer 

are  expressed  in  terms  of  those  of  the  two  layers 
and  (Tj/T^)-  The  adding  method  uses  the  same  principles  with  a 
different  algorithm  and  it  reduces  to  doubling,  which  is  much 
faster, for  an  homogeneous  atmosphere.  On  the  other  hand,  the 
invariant  imbedding  uses  the  addition  of  infinitely  thin  layers 
to  obtain  differential  equations  for  the  reflection  and  trans- 
mission functions. 

Finally,  in  the  case  of  very  thick  layers,  asymptotic 
relations  can  be  derived;  the  scattering  function  of  a layer  of 
optical  thickness  is  expressed  in  terms  of  the  same  function 
for  a semi-infinite  layer;  the  correcting  term  contains  the  solu- 
tion of  the  Milne  problem  for  the  same  atmosphere  and  decreases 
-2vx,  . , . 

as  e ^ , where  v is  the  inverse  of  the  higher  eigenvalues  which 
appears  in  the  Case  and  in  the  Spherical  Harmonics  methods; 
relations  exist  for  the  transmitted  and  the  internal  radiance. 
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Table  2 shows  an  example  of  numerical  comparisons^  between 

the  Spherical  Harmonics,  the  Matrix  Operator,  and  the  Successive 

Scattering  methods.  It  gives  the  intensity  at  some  depths  t and 

for  some  directions  for  the  case  of  a haze  layer  with 

=1,  io  = 0.9  and  normal  incidence  (y  = - 1)  . The  relative 
o o 

difference  A between  Spherical  Harmonics  and  Matrix  Operator 
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All  the  numerical  results  presented  in  this  paper  have  been 
computed  in  the  framework  of  a comparison  program  sponsored  by  the 
Radiation  Commission  and  the  author  is  greatly  indebted  to  all 
contributors . 
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values  is  always  smaller  than  two-tenths  of  a percent.  In  the 
Successive  Scattering  method,  the  accuracy  can  be  increased  by 
increasing  the  nximber  of  iterations  and  the  computation  whose 
results  are  given  here  has  been  stopped  in  order  to  achieve  an 
accuracy  of  about  1%;  it  is  even  better  than  that. 

The  computation  time  is  shorter  with  the  Spherical  Harmonics 
method  than  with  the  Matrix  Operator  method,  except  maybe  when  a 
very  large  number  of  solar  directions  are  wanted  at  the  same  time. 
For  the  Successive  Scattering  method,  the  computation  time  is 
quite  competitive  in  this  case,  but  it  increases  very  fast  when 
0)^  tends  to  1 and  when  the  optical  thickness  increases. 

Table  3 shows  the  accuracy  obtained  by  Monte  Carlo  and  Dart 
methods  for  the  same  case  of  x = 0.2  and  for  various  values  of  u. 
The  values  plotted  are  the  relative  differences  A with  the  "exact” 
values  obtained  by  both  the  Spherical  Harmonics  and  the  Matrix 
Operator  methods.  The  Monte  Carlo  program  has  been  run  by  two 
groups  of  authors;  in  both  cases,  the  accuracy  is  about  a few 
percent,  sometimes  better  than  1%.  For  the  Dart  method  the  error 
is  a little  larger,  but  remains  always  smaller  than  10%.  The 
time  is  much  larger  for  both  these  methods  than  for  the  semi- 
analytical  methods;  but  their  main  interest  is  in  their  ability 
in  handling  non-plane-parallel  cases  as  we  will  see  later. 

Table  4 shows  again  a comparison  of  the  intensity  computed 
by  various  methods,  now  in  the  case  of  a thick  conservative  cloud 
(t^  = 64;  (0^  = 1;  = - 1) . Many  methods  are  unable  to  treat 

such  a large  optical  thickness  without  a prohibitive  computation 
time. 

In  the  Spherical  Harmonics  method,  the  computation  time  is 
nearly  independent  of  the  optical  thickness.  But  the  difficulty 
here  is  related  to  the  forward  peak;  Dr.  Devaux  has  approximated 
the  phase  function  of  the  cloud  by  the  sum  of  a Delta-function 
and  an  expansion  with  36  Legendre  polynomials.  Therefore,  the 
result  is  wrong  in  the  forward  direction  (y  = - 1)  for  small 
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TABLE  3 
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optical  depths;  it  might  easily  be  checked  that  the  values  found 
at  y = -1  for  x =6.4  are  smaller  than  the  values  expected  for 
primary  scattering  only.  Elsewhere,  it  is  expected  that  the 
truncature  procedure  gives  an  accuracy  better  than  1% . 

The  Monte  Carlo  method  has  no  problem  with  the  forward  peak 
and  the  results  at  y = -1  and  small  optical  thickness  are  better 
than  those  of  the  Spherical  Harmonics.  But  the  computation  time 
becomes  very  large  for  such  large  optical  thickness  and  only  one 
group  of  authors  has  run  the  Monte  Carlo  program  for  the  cloud 
case.  Except  for  y = - 1,  the  results  can  be  compared  with  those 
of  the  Spherical  Harmonics;  the  accuracy  seems  of  the  same  order  as 
that  for  Haze. 

Finally,  the  Asymptotic  method  finds,  in  the  case  of  large 
optical  thickness,  its  own  field  of  application.  Of  course,  it 
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does 

not  apply  at  small  optical  depth 

for  downward  radiation. 

But  elsewhere,  its  agreement  with  the  Spherical  Harmonics  is 


better  than  5%. 


D.  Approximate  Methods 

By  approximate  methods,  we  understand  methods  which  include 
a very  rough  approximation  of  the  atmospheric  properties  or  (and) 
of  the  transfer  problem.  They  generally  give  only  the  flux  and 
not  the  intensity,  but  the  computation  time  is  reduced  by  a factor 
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larger  than  100  or  even  the  use  of  a computer  can  be  avoided. 

Moreover  they  give  simple  analytical  expressions  of  the  flux. 

Between  them  we  have  classified  the  similarity  relations  which 

reduce  the  anisotropic  problem  to  an  isotropic  one,  the  Eddington, 

Two-Stream,  and  Exponential  Kernel  methods.  Various  modified 

versions  of  Two-Stream  and  Eddington  have  been  tried  with  success - 

2 

Recently,  Meador  and  Weaver  have  proposed  a general  theoretical 
framework  to  compare  these  methods.  Figure  1 shows  the  net  flux 
versus  t in  the  case  of  a haze  layer  in  normal  incidence  = - 1) 
and  for  = 0.9.  The  solid  line  corresponds  to  the  "exact**  values 
obtained  by  various  methods  within  an  agreement  of  a few  tenth  of 
a percent;  the  points  correspond  to  various  approximate  methods. 

The  worst  results  are  obtained  by  the  Eddington  and  Standard 
Two-Stream  methods;  the  accuracy  varies  from  a few  to  10%.  The 
best  results  are  given  by  the  Delta-Eddington,  the  Double  Delta- 
Eddington,and  the  Exponential  Kernel  methods,  with  an  accuracy  of 
about  1%.  Intermediate  performances  are  achieved  by  other  modifi- 
cations of  Eddington  or  Two-Stream  methods. 

The  results  remain  about  the  same  when  C tends  to  1,  but 

o 

'most  of  the  methods  are  a little  less  accurate  in  oblique  than 
in  normal  incidence,  except  the  Eddington  method  which  becomes 
better. 

If  we  consider  now  increasing  the  optical  thickness,  we  may 
say  that  a thick  conservative  cloud  is  the  worst  case  for  the 
Standard  Two-Stream  method  (accuracy  ^ 24%)  and  the  best  for 
Eddington  method  (accuracy  1%)  . But  again,  always  the 
Exponential  Kernel  and  the  Delta-  and  Double  Delta-Eddington 
achieve  an  accuracy  of  about  1%. 


"Personal  communication  of  their  work  is  acknowledged  by  the 


author. 
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IV.  REALISTIC  ATMOSPHERES 


A.  Inhomogeneous  Atmospheres 

It  must  be  first  noted  that  the  case  we  have  called 
"homogeneous"  is  actually  the  case  where  only  the  extinction 
coefficient  K is  function  of  the  altitude  z (see  definition  of  the 
optical  depth  x) , with  the  single  scattering  albedo  and  the  phase 
function  constant  throughout  the  atmosphere. 

A vertically  inhomogeneous  atmosphere  occurs  when  mixing 

ratio  of  scatterers  and  absorbers  varies  with  height  (ai  function 

o 

of  x) , or  when  the  type  of  scattering  particles  varies  with  height 
(phase  function  varying  with  x) , 

The  vertical  inhomogeneity  can  be  handled  by  the  equation  of 
transfer.  Eg.  (4),  including  the  variable  x in  or/ and  in 

p(x;]i,(j>;iJ*(j)*).  Most  of  the  methods  reviewed  for  the  homogeneous 
case  can  be  applied,  perhaps,  with  slight  modifications.  At  the 
limit,  the  vertically  inhomogeneous  atmosphere  can  always  be 
approximated  by  superposition  of  thin  homogeneous  layers.  Anyway, 
it  must  be  noted  that  the  computation  time  may  increase  rapidly 
with  the  inhomogeneity  (for  example,  doiibling  is  replaced  by  adding) 
and  that  in ■ some  methods  numerical  difficulties  or  instabilities 
appear  (Spherical  Harmonics  or  Discrete  Ordinates) . 


The  horizontal  inhomogeneity  is  much  more  difficult  to  handle, 
as  it  can  be  imagined  from  the  equation  of  transfer 


9 1/2 

(1  - y^)  cos  <() 


31  (x,y,z;y  ,t|)) 
3x 


+ (1  - y2) 


1/2 


sin  (j) 


31  (x,y,z;y  ,(j))  3I  (x,y,z;y  ,(|)) 

3y  3z 

- K(x,y,z)  {I  (x,y,z;y  ,(j>)  - J(x,y,z;y  ,(|))  } (13) 

where  the  three  coordinates  x,  y,  z and  the  three  corresponding 
partial  derivatives  appear.  This  case  comprises  the  very  important 
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problem  of  finite  clouds.  Here,  the  flexibility  of  the  Monte  Carlo 
Method  (Refs.  3,  4,  and  5)  finds  all  its  advantages,  and  although 
a few  attempts  have  been  done  to  develop  other  methods,  for 
example,  invariant  imbedding  (Ref.  6)  or  statistical  methods 
(Ref.  7) , the  only  results  obtained  for  finite  clouds  until  now 
have  been  obtained  by  Monte  Carlo. 


B.  Spherical  Atmospheres 


For  some  problems,  we  have  to  consider  the  sphericity  of  the 
real  atmosphere.  This  is  particularly  important  in  remote 
sensing  of  the  atmosphere  from  twilight  measurements  or  from  limb 
scanning  by  satellites. 

Using  spherical  coordinates  with  reference  to  the  local 
vertical,  the  transfer  equation  has  the  form 


(1  - _9_ 

R + z 9y 


, 1/2  „ 1/2 
(1  - p2)  (1  _ 

R + z 


COS((|)  - (J)^) 
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+ Sin((|>  - ((>  ) -r-r- -—5-  } I(z,y  ,())  ;y,(())  = 

^ _ ^2  o 3((j)  - (j)^)  00 


- K(z)  {l  (z,y^,(j)^;y  ,(())  - J (z , y^ , y , (}))  } 


(14) 


where  R is  the  earth  radius.  Here  again,  the  Monte  Carlo  method 
(Ref.  8)  can  be  applied  and  the  Dart  method  (Ref.  9)  has  been 
especially  studied  for  this  case;  but  the  research  in  approximate 
analytical  methods  seems  premising  (Ref.  10). 


C.  Scattering  with  Gaseous  Absorption 

All  what  we  have  said  here  concerns  monochromatic  radiation. 
A last  problem  appears  when  we  have  a radiative  transfer  problem 
in  a scattering  atmosphere  with  gaseous  absorption.  Then,  the 
number  of  monochromatic  problems  to  be  treated  becomes  prohibi- 
tive, even  for  a single  absorption  line.  Therefore,  approximate 
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methods  have  been  sought.  One  of  them  consists  in  introducing  the 
photon  path  distribution  P(X)  (Refs-  11  and  12)  which  is  defined  by 


I(B^)  - I(S^) 


,oo 


P(X) 

•^0 


exp  { 


Xk 

V 

a + k 

c 


}dX 


(15) 


where  the  subscript  V refers  to  the  frequency  v and  the  subscript 
c to  the  continuum  outside  the  line;  P(X)  can  be  obtained  either 
directly  from  a Monte  Carlo  calculation  or  by  the  inverse  Laplace 
transform 


P(X) 


r 1(03^)  ^ 


r = k / (a  + k ) 
V c 


(16) 


When  P(X)  is  once  for  all  obtained,  the  intensity  at  any  fre- 
quency of  the  spectrum  can  be  obtained  from  Eq.  (15) . Moreover, 
it  is  possible  for  some  problems  to  define  various  mean  path 
lengths . 


V.  CONCLUSION 

In  the  case  of  plane-parallel  atmospheres,  even  with  vertical 
inhomogen iety,  several  methods  can  be  used  to  obtain  the  complete 
radiation  field  with  a good  accuracy  and  a reasonable  computer 
time.  If  only  the  flux  and  the  heating  rate  are  sought,  fast 
approximate  analytical  methods  are  available. 

When  more  realistic  atmospheric  models  must  be  considered, 
the  Monte  Carlo  can  be  used  with  a rather  good  accuracy,  but  at 
the  price  of  large  computation  times,  while  active  research  on 
faster  approximate  methods  is  carried  on. 
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source  function  for  thermal  emission 

sun  irradiance 

radiance 

radiance  matrix 

upward  radiance 

downward  radiance 

source  function 

absorption  coefficient 

absorption  coefficient  and  single  scattering  albedo 
in  the  continuum 

absorption  coefficient  and  single  scattering  albedo 
at  frequency  v 
extinction  coefficient 
phase  function 
phase  matrix 

coefficients  of  the  phase  matrix 

photon  path  length  distribution 
Stokes'  parameters 
position  variable 
earth  radius 

diffuse  reflection  function 
diffuse  transmission  function 
altitude 

Mie  parameter  (ratio  of  circumference  to  wavelength) 
zenith  angle 
cos  6 

sun  direction 
scattering  coefficient 
optical  depth 
total  optical  thickness 
azimuth  angle 
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= a/K  single  scattering  albedo 

ti  direction  variable 
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DISCUSSION 


Turner:  Do  you  know,  if  anyone  has  used  a method  similar  to  the 

spherical  harmonics  method  using  other  orthogonal  functions,  for 
example,  Gegenbauer  or  Jacobi  polynomials? 

Lenoble:  I have  never  heard  about  that  but' it  would  be  possible. 

Turner:  It  should  be  much  better  than  the  spherical  harmonics 

method  because  these  functions  more  nearly  represent  the  anisotropy 
of  the  scattering  phase  function  and  with  fewer  terms  in  the  series 
expansion. 

Lenoble:  Yes,  but  that  is  true  for  the  phase  function  alone - 

Turner:  Yes,  but  one  can  expand  the  radiation  field  function  in 

a series  of  these  more  general  functions. 

Irvine:  But  when  you  do  the  multiple  scattering^  it  is  convenient 

to  use  spherical  harmonics,  because  you  can  then  expand  the  inten- 
sity in  a Fourier  series  in  azimuth  and  each  azimuthal  component 
satisfies  an  independent  equation  of  transfer. 

Turner:  But  there  are  addition  formulas  for  these  other  poly- 

nomials although  they  are  considerably  more  complicated  than  those 
for  the  spherical  harmonics. 

Irvine:  I don't  know  of  any  that  is  being  done. 

Gal:  I would  like  to  answer  some  of  these  questions.  We,  at 

Lockheed  Palo  Alto  Research  Laboratory,  are  applying  the  Hartel 
formulation  for  multiple  scattering  by  spherical  particulates. 
Hartel  (Germany,  1941)  reported  that  multiple  scattering  may  be 
solved  by  obtaining  solution  for  the  scattering  function  by  suc- 
cessive scattering.  The  scattering  function  for  each  scattering 
order  may  be  written  in  terms  of  a Legendre  series  expansion. 

Hartel  did  not  have  the  mathematical  tools  to  prove  his  idea. 

Since  then,  the  Legendre  expansion  is  available  for  a spherical 
particle  and  my  colleague.  Dr.  Chou,  solved  the  radiation  transfer 
equation  for  a plane  parallel  geometry.  Agreement  with  currently 
available  solutions,  such  as  Dave's  iterative  and  Monte  Carlo 
methods  are  excellent.  The  advantage  of  our  solution  is  that  it 
requires  an  order  of  magnitude  less  computer  time.  This  work  will 
be  published  in  a few  months. 

Lenoble:  I would  like  to  have  it. 

Barkstrom:  Two  comments.  The  first  comment  is  that  in  stellar 

atmospheric  work  and  some  recent  work  which  we  have  done,  there  is 
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an  alternate  procedure  which  breaks  up  the  equation  of  transfer 
into  a flux  conservative  finite  difference  form.  The  computation 
time  is  comparable  with  that  of  the  doubling  method,  and  it  isn't 
bothered  by  vertically  inhomogeneous  atmospheres.  Secondly,  I am 
not  sure  which  approximate  analytical  methods  you  are  referring  to 
in  the  three-dimensional  case.  There  are  a momber  of  diffusion- 
type  approximations  that  are  related  to  the  delta-Eddington  approxi- 
mation. I expect  they  are  going  to  appear  in  the  literature  very 
shortly. 

Lenoble:  For  the  spherical  case,  I was  referring  mostly  to  the 
work  by  the  Soviet  group  of  Sobolev  and  Minin  and  this  kind  of 
work.  But  we  are  now  working  on  the  report  for  spherical  atmos- 
pheres and  for  three-dimentional  problems.  It  is  just  at  its  very 
beginning  and  I don't  have  much  information  on  this  work. 

Barkstrom:  These  methods  are  connected  with  problems,  such  as 

finite  clouds. 

Lenoble:  But  for  spherical  problems  you  mentioned  the  problem  of 

stellar  atmospheres.  It  is  quite  different  because  you  don't  have 
the  solar  illumination  so  you  have  really  a spherical  symmetry. 

Barkstrom:  It  doesn't  matter;  the  procedure  is  the  same. 

Unidentified  Speaker:  What  are  the  publication  plans  for  the 

report  you  are  preparing  for  the  Radiation  Commission? 

Lenoble:  Well,  the  first  draft  of  the  report  has  been  published. 

But  the  final  printing  is  to  be  done  at  NCAR.  Maybe  Professor 
London  can  give  some  more  infoimation  on  the  publication. 

Londoni  it  will  be  available  through  the  Radiation  Commission. 

Lenoble:  But,  presently,  it  is  only  the  part  concerning  the  plane 

parallel  atmospheres . 
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SOME  ASPECTS  OF  THE  INVERSION  PROBLEM 


IN  REMOTE  SENSING 


S • Twomey 

University  of  Arizona 


A brief  discussion  of  several  commonly  used  methods  for 
inversion — constrained  linear  inversion,  synthesis  (Backus- 
Gilbert)  methods  and  nonlinear  iterative  techniques  for  the 
Chahine  type — is  given.  It  is  demonstrated  that  a very  close 
connection  exists  between  BackUs-Gilbert  solutions  and  those 
given  by  constrained  linear  inversion. 

A number  of  examples  of  the  application  of  such  methods 
are  presented , showing  that  resolution  is  not  greatly  dif- 
ferent for  quite  different  algorithms — a result  quite  in 
accord  with  general  theoretical  considerations:  more 
"resolution"  can  be  achieved  at  the  expense  of  introducing 
greater  a priori  bias  in  the  procedure. 


INTRODUCTION 

When  Kaplan  (Ref.  1) , in  1959,  outlined  the  possibilities  of 
determining  atmospheric  structure  from  infrared  radiance  measure- 
ments, there  were  initially  probably  more  sceptics  than  enthusiasts 
Nevertheless,  the  steps  toward  implementation  proceeded  faster  than 
most  (at  least  in  terrestrial  atmospheric  physics)  and  it  was  only 
a few  years  later  that  a crude  (horizontal)  temperature  sounding 
experiment  was  carried  out  through  an  open  window  of  the 
Meteorological  Satellite  Center  in  Suitland,  Maryland-  At  that 
time,  inversion  methematics  tended  to  be  concentrated  around  the 
determination  of  coefficients  in  arbitrary  expansions  of  the 
unknown,  and  results  from  the  first  attempted  inversion  were  quite 
promising  but,  being  restricted  to  the  coefficients  of  a quadratic 
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not  capable  of  coming  very  close  to  reality — which  contained  sharp 
"step"  in  temperature  akin  to  an  atmospheric  thermal  inversion  and 
could,  therefore,  be  modeled  reasonably  only  if  two  inflections 
could  be  allowed  into  the  solution.  This  implied  a cubic  rather 
than  a quadratic,  and  a fairly  routine  improvement  of  the  inversion 
from  three  unknowns  to  four  was  undertaken  by  the  Suitland  group. 
Disaster  soon  ensued.  The  cubic  "improvement"  indicated  negative 
absolute  temperatures  and  super -adiabatic  lapse  rates,  a tempera- 
ture distribution  which  was  physically  unacceptable,  but  which,  if 
it  could  exist,  would  give  radiance  values  experimentally  undis- 
tinguishable  from  those  given  by  the  much  less  spectacular  tempera- 
ture structure  which  actually  existed. 

These  facts  are  mainly  of  historical  interest  and  are  well 
known  to  the  present  audience.  We  still  find  a great  deal  of 
attention  directed  to  procedures  and  algorithms  for  inversion,  the 
implication  being  that  a more  sophisticated  numerical  technology 
can  sidestep  the  obstacles  which  in  the  1960s  turned  modest  quad- 
ratic success  into  cubic  nonsense.  But  the  ambiguity  of  inversions 
is  fundamental,  caused  by  the  kernels,  which  describe  the  under- 
lying physical  connection  between  measured  and  sought  functions, 
and  a successful  algorithm  can  only  succeed  by  making  an  acceptable 
selection  from  all  the  possibilities-  That  selection  is  arbitrary; 
its  basis  does  not  lie  within  the  measurement  or  the  integral 
relationship  connecting  sought  distribution  with  the  measured 
quantities. 

The  simplest  version  of  the  indirect  sensing  problem: 
b 

g = / K (x) f (x)dx  (1) 

^ a ^ 

or 

g = /^  k(x)f  (x)dx 
a ““ 

describes  a kind  of  convolution  of  the  kernel  functions  with  the 
unknown  or  "sought"  function  f (x) - In  real  atmospheric  physics 
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problems,  K^(x)  is  fundamentally  smooth,  being  inmost  cases 
necessarily  positive  and  essentially  exponential  in  character,  of 
the  form  T.  g.e  or  / ^(u)e  with  5.  or  ij^(u)  non- 

negative.  Smoothness  of  implies  a diminishing  sensitivity 

of  g^  to  higher  frequency  components  in  f (x) , and  a fundamental, 
inherent  instability  and  ambiguity  in  any  inference  of  f (x) • This 
has  nothing  to  do  with  inversion  algorithms,  linear  or  nonlinear, 
simple  or  complicated:  they  do  not  even  have  to  be  brought  into 
the  picture  to  show  the  fundamental  ambiguities.  We  simply  cannot 
get  something  for  nothing. 

Figure  1 shows  two  curves  from  an  early  paper  by  Wark  and 
Fleming  (Ref.  2) ; it  shows  kernels  dt/d  In  p for  an  Elsasser  band 
with  maximum  contribution  at  100  mb  and  also  for  a constant  mass 
absorption  coefficient  (i.e.,  simple  exponential  behavior) . (1  bar 

= 100  kPa.)  Note  that  the  width  and  smoothness  of  the  two  are  com- 
parable— the  more  realistic  band  model  is  slightly  narrower  but 
not  markedly  so.  This  implies  that  a great  deal  can  be  learned  by 
looking  at  simple  exponential  kernels  (which  bring  us  to  a LaPlace 
transform  inversion  problem)  or  kernels  of  the  form  xe  (which 
have  the  advantage  that  they  attain  maxima  at  x = y ^ . 

— yX 

The  power  spectrum  of  the  kernel  xe  can  easily  be  cal- 
culated; it  decreases  rapidly  with  increasing  frequency,  asymptot- 
ically as  oj 

Figure  2 shows  the  Fourier  transform  of  a kernel  related  to 
the  indirect  measurement  of  particle  size  distribution  from  light 
scattering-  This  illustrates  an  important  and  often  overlooked 
point — that  a physical  kernel  may  have  in  its  spectrum  a blind 
spot  at  relatively  low  frequencies,  even  though  it  has  not  yet 
begun  its  ultimate  asymptotic  decline - 

Early  inversion  procedures  were  linear  in  nature  and  amounted 
to  the  inference  of  coefficients  in  some  expansion  of  f (x)  in  the 
form 
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f (x)  = a + hx  + cx^  + 
or 

f(x)  = + a^(f)^(x)  + a^cj)^(x)  + ... 

but  soon  use  of  the  values  fix.),  f(Xo)  •••  f(x  ) of  f (x)  at 

j.  z n 

selected  tabular  values  of  x became  ccxnmon  practice.  There  is 
fundamentally  no  difference  between  the  two  approaches  since  f .(x^ ) , 
f(x^),  etc.  can  be  identified  with  coefficients  in  an  expansion 
of  f (x)  in  terms  of  a set  of  functions  4>j^(x)  which  are  unity  at 
X = X and  fall  linearly  to  zero  at  x = x and  x = x . With 

K jC  1 X + 1 

further  development  of  inversion  algorithms,  there  tended  to  be  a 

return  to  the  expansion  approach  in  a somewhat  different  guise ^ 

in  which  empirical  orthogonal  functions  were  used  for  the  functions 

(|)^(x).  Use  of  such  functions^  coupled  with  constraints  resting  on 

the  fundamental  properties  of  empirical  orthogonal  functions  (EOF's), 

led  to  more  "realistic’*  inversions.  Nevertheless,  it  is  important 

to  realize  what  is  going  on  in  such  cases;  one  forms  a set  of 

orthogonal  functions  which  are  linear  combinations  of  observed 

functions;  these  latter  may  be  measured  with  instruments  of  high 

resolution  and  can  contain  harmonic  components  which  may  extend  to 

frequencies  to  which  a given  indirect  sensing  procedure  (with 

finite  accuracy,  smooth  band-limited  kernels)  is  ’’blind.” 

Furthermore ,’ orthogonality  of  the  functions  d>  (x)  , 6 (x)  does  not 

m n 

imply 

/k^(x)  - 4>j^(x)]dx  ^ 0 

so  two  independent  EOF’s  could  be  undistinguishable  to  the  remote 
sensing  measurement.  A stable  solution  is  possible  only  because, 
for  example,  one  demands  that  the  expansion  coefficients  fall  off 
at  a rate  similar  to  that  found  on  average  in  the  original  popu- 
lation of  measured  soundings  (e.g.,  a large  set  of  radiosonde  pro- 
files) . In  this  way,  a greater  degree  of  structure  can  be  retained 
in  a solution  but  it  is  in  a sense  pseudo-resolution — the  solution 
is  permitted  to  "wiggle"  in  a way  which  is  expected  on  the  basis  of 
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past  data.  This  kind  of  procedure  while  in  many  ways  eminently 
reasonable,  does  nevertheless  introduce  a strong  probability  of 
bias — for  example,  solutions  for  unpopulated  or  oceanic  areas 
(those  for  which  indirect  sensing  is  likely  to  be  most  valuable) 
in  essence  become  pushed  toward  a behavior  pattern  representative 
in  some  sense  of  more  populated  regions  where  direct  measurements 
are  most  frequent.  Since  people  and  radiosondes  are  simply  not 
randomly  distributed  over  the  globe,  the  distortions  produced 
through  such  bias  are  systematic  in  nature.  On  the  part  of  general- 
circulation  and  numerical  weather  prediction  researchers,  a degree 
of  disenchantment  with  satellite-based  temperature  soundings  seems 
to  have  emerged  recently;  it  is  germane  to  ask  how  much  the  afore- 
mentioned bias  contributes  to  this  disenchantment. 

It  is  also  worth  pointing  out  that  the  atmosphere  as  a whole 
is  generally  close  to  a balanced  condition;  one’s  primary  concern 
in  prediction  is  with  departures  from  that  balance.  There  is  no 
skill  needed  to  predict  cloudless  conditions  with  midday  tempera- 
tures near  the  80s  for  Arizona  in-  winter,  or  to  predict  showery 
conditions  with  steady  trade  winds  for  Hilo,  Hawaii.  It  is  the 
departures  from  these  norms  that  constitute  the  prediction  problem 
and  if  we  distort  temperature  profiles  toward  the  expected  norm,  it 
must  adversely  affect  the  chances  of  predicting  excursions  away 
from  the  norm. 


II.  RESUME  OF  COMMONLY  USED  METHODS 

A.  Constrained  Linear  Inversion 

This  consists  in  its  most  direct  application  of  the  conver- 
sion of  the  integral  equation  to  a quadrature  form  in  which  the 
function  f (x)  is  replaced  by  a vector  containing  tabular  values  of 
f (x)  in  its  elements  (the  behavior  of  f (x)  between  the  tabular 
points  being  implicitly  specified  by  the  quadrature  scheme) , and 
the  resulting  matrix -vector  equation  is  inverted  to  find  the  most 
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acceptable  vector  f,  which  satisfies  the  fundamental  relationship 
to  g to  within  a prescribed  accuracy.  "Most  acceptable"  is  almost 
always  specified  numerically  in  terms  of  minimization  of  some  quad- 
ratic form  f*Hf  in  f which  we  arbitrarily  introduce  to  gauge 
"acceptability."  In  general,  the  solution  is 

f ' = (A*A  + YH)"^  A*g  (2) 

We  obtain  a useful  solution  provided  H is  framed  to  be  in  some 
sense  a measure  of  the  smoothness  of  f.  One  can,  of  course,  take 
out  any  initial  guessed  or  known  expected  value  f^  and  remove  the 
corresponding  teim  Af^  from  g before  inverting.  The  shape  of  f^ 
is  arbitrary  and  if  it  contains  wiggles  or  other  high-frequency 
features  these  will  show  up  in  the  solution.  But  this  is  artificial 
in  the  sense  that  for  many  physical  kernels  these  could  be  filtered 
from  f^  without  changing  g perceptibly. 

Linear  constrained  inversion  methods  are  simple  extensions  of 
least-squares  methods  for  solving  systems  of  linear  equations. 
Methods,  in  which  f (x)  is  described  by  a vector  of  expansion  coef- 
ficients, are  algebraically  equivalent.  It  is,  however,  less  easy 
to  formulate  measures  of  acceptability  in  terms  of  such  coefficients 
if  the  expansions  are  polynomials  or  are  made  in  terms  of  arbitrary 
orthogonal  functions  (e.g.,  Fourier  or  Fourier-Bessel  expansions, 
Tchebycheff  polynomials,  etc.).  But  if, when  empirical  orthogonal 
functions  are  used,  one  has  a priori  statistical  grounds  for  asking 
the  coefficients  to  diminish  at  a known  rate  , this  provides  a 
valuable  method  of  inversion  in  situations  where  a sizable  back- 
ground of  data  measurements  for  f (x)  is  available.  In  this  pro- 
cedure, one  is  asking  that  the  solution  f * (x)  be  "like"  the  popu- 
lation from  which  the  orthogonal  function  set  was  constructed. 

B.  Synthesis  Methods 

All  linear  methods  of  solution  produce  a solutuion  f’(x)  which 
for  any  value  of  x consists  of  a linear  combination  of  the  measured 
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g' s,  i.e, , 


(3) 


The  array  is  often  not  calculated  explicitly,  as  in  the  case 


of  constrained  linear  inversion  methods,  in  which 


given  by  (A*A  + yH)  ^A*. 


kj' 


would  be 


The  Backus-Gilbert  (Ref.  3)  solution 

’ b . and  the  con- 

k: 

Equation  (2)  has  the 


directed  its  attention  explicitly  to  the  array  b^^  and  the  con- 
straints are  formulated  in  terms  of  the  b. 


kj‘ 

consequence  that  the  solution  value  f*{x^)  can  be  written 
b 


= 


y . b .K . (x)  1 f (x)dx 

k:  D 


so  that  E,  b .K.(x)  is  a "scanning  function"  which,  of  course, 
k k3  3 

would  have  to  be  the  delta  function  6 (x  - x ) if  f * (x  ) was  to 

iC  JC 

reproduce  ^ exactly.  That  is  an  evident  impossibility  and  the 

linear  inversion  problem  can  be  regarded  as  a search  for  realizable 
scanning  functions  which  approximate  the  delta  function  and  are  as 
far  as  possible  free  from  side  lobes  and  other  undesirable  features 
which  could  distort  f*(x)  and  produce  in  it  artificial  peaks  or 
troughs.  It  is  sufficient  to  consider  only  normalized  scanning 
functions,  so  that 


S,  (x)dx  = 1 


Since  the  coefficients  b ultimately  multiply  the  errors  in  g, 

iC  ^ 

stability  is  ensured  if  the  magnitude  of  ” I ^ 

limited.  To  complete  the  Backus-Gilbert  procedure,  one,  therefore, 

needs  to  solve  a minimization  problem  which  is  very  similar  to  that' 

encountered  in  constrained  linear  inversion.  An  array  of  coef- 

ficients  b is  to  be  determined  which  minimizes  a spread  b , S b 

measured  about  the  point  x = x , with  b*  b < Constant  and 

X X X 

bj^*k^  also  constant  (k^  being  written  here  for  the  array 

K. (x)dx,  j = 1,  2,  3 ...  m).  The  two  constraints  (one  restric- 
a 3 

ting  the  magnitude  of  the  coefficients,  the  other  a normalization 
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constraint)  imply  two  Lagrangian  multipliers,  but  the  solution 
algebra  is  again  quite  straightforward  and  we  obtain 

-1 


(4) 


B ^ must  be  k*  (S  + yi) 
/ (x  - Xj^)  (x)dx,  is 


3 is  readily  calculated  explicitly — since  k^*b.  must  be  unity. 

For  the  particular  spread  measure 
- X.  ) ^K.  (x)K.  (x)dx  II  , and  Y plays 

K X J 

a role  similar  to  that  played  by  y in  constrained  linear  inversion, 
in  that  as  y decreases  in  size,  error  magnification  increases  and 
the  scanning  function  becomes  narrower,  with  diminishing  y. 
Backus-Gilbert  solutions  involve  covariance  matrices  C = 

II  /k.  (x)K.  (x)dx  II  and  vectors  such  as  |j/  K.  (x)dx  |[  and  ||  K.  (x  ) j|. 

To  the  accuracy  of  quadrature,  these  quantities  are  connected  with 
those  occurring  in  constrained  linear  inversion  through  relation- 
ships C = AY,  k^  = II  /K^(x)dx  II  = Ae,  and  so  on.  Y is  written  for 
the  m X n tabulation  ||  (x ^ ) ||  of  the  kernels.  By  these  relation- 
ships, Backus-Gilbert  solutions  can  be  transformed  into  relation- 
ships involving  the  quadrature  matrix  A and  the  measured  g,  and  a 
very  close  relationship  can  be  established  between  solutions 
obtained  by  the  two  methods-  One  simple  such  relationship  applies 
to  the  direct  inverse  f*  = A ^g,  which  can  also  be  obtained  by  the 
Gilbert-Backu.s  procedure  if  the  square  noirm  of  the  difference 
between  the  delta  function  and  the  scanning  function  is  the  basis 
for  optimization;  carrying  out  this  optdLmization  (unconstrained) 
one  obtains  a solution  which  the  substitution  C = AY  transforms 
into  the  direct  inverse  A ^g. 


The  trade-off  curves  provided  automatically  in  Backus-Gilbert 
inversion  shows  how  much  accuracy  of  measurement  is  needed  to  give 
a certain  resolution.  As  we  change  y in  Eq.  (4) , the  width  of  the 
scanning  function  and  the  magnitude  of  the  b-vectors  change;  the 
latter  magnitude  determines  how  much  error  magnification  can  enter 
into  the  solution  (which  is  f^*  = error  mag- 

nification can  be  plotted  against  resolution.  Such  a plot,  for 
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WIDTH  OF  SCANNING  FUNCTION 

Fig-  3.  Trade-off  curves  by  Backus-Gilbert  procedure  for 
—ux 

kernels  of  the  form  xe  . Several  values  of  y,  giving  maxima  at 
X - 0.2,  0,4,  0,6,  and  0,8,  are  represented. 

“VX 

kernel  xe  is  shown  in  Fig.  3.  It  is  apparent  that  beyond  a 
certain  point  (one-half  width  of  scanning  function  ~ 0.2  times  the 
interval  of  integration) , a huge  increase  in  accuracy  is  needed  to 
produce  even  a small  increase  in  resolution.  This  is  not  an  arti- 
fact of  the  Backus-Gilbert  method  (Ref.  3) — it  is  a property  of  the 
kernels  involved  and,  furthermore,  is  not  significantly  influenced 
by  the  number  of  kernels  used,  provided  that  a reasonable  number 
are  employed  (7  to  10,  say) . Similar  plots  can  be  made  of  resolu- 
tion against  mamber  of  kernels  (for  a fixed  error  magnification)  and 
they  show  a rapidly  diminishing  return  with  further  increase  in 
number  of  kernels. 


II 


C,  Statistical  Inversion  Methods 

Quite  apart  from  linear  inversion  techniques  with  constraints 
based  on  statistics  (e.g.,  EOF  expansions),  statistics  find  appli- 
cation even  more  directly.  One  sees  in  inversion  linterature  a 
very  natural  tendency  to  use  comparisons  between  "measured"  data 
(radiosonde  soundings,  for  instance)  and  data  given  by  inversion 
as  a basis  for  judging  the  relative  merits  of  the  various  numerical 
procedures.  If  we  can  make  enough  simultaneous  measurements  of 
both  functions  g(y)  and  f (x) , then  the  relationship  between  g(y) 
and  f (x)  in  principle  can  be  "learned"  from  these  measurements, 
provided  only  that  it  is  physically  reasonable  to  assume  that  the 
f"^g  relationship  is  linear  to  a good  approximation-  This  is 
essentially  the  procedure  followed  in  certain  kinds  of  "statistical" 
inversions  where  a matrix  (B,  say)  is  inferred  from  a set  of 
paired  distributions  f and  g.  One  is  seeking  to  learn  B,  where 

f = B g + e (5) 

hxi  nxmmxi 

and  a number  of  fairly  obvious  minimization  approaches  can  be  fol- 
lowed to  minimize  |e|^  or  some  other  suitable  norm.  One  can,  for 
example,  for  any  specified  x = ^ (i.e.,  elemental  position  within 

-K 

f) , solve  for  the  kth  row  of  B,  imposing  as  usual  a constraint  for 

stability.  Thus,  if  the  kth  row  of  B is  b , we  have 

^ ~k 

‘it  ■ ‘ ' It  ' ^It's  (6) 

for  every  set  of  m measurements.  If  f denotes  the  value  of  f 

Ik  k 

from  the  ith  measurement  and  g^  is  the  corresponding  array  of  g- 
values,  one  can  write 


51 


I 


II  II  I nil  II II  III  II  Hill  I III 


nil 


I 


I I I 


■< 

?1 



?2 

■< 

?3 

> 

L X m 


' 

f 

1 

^Ik 

1 

?k 

• 

< 

> 

1 ^mk^ 

(7) 


and  solve  b,  . For  each  k (row  of  B)  the  procedure  can  be  repeated 

a. 

and  thereby  a complete  matrix  B generated.  The  constraint  b = 

Constant  ensures  error  magnification,  the  constraint 

(variance  of  f^)  relates  the  permitted  variation  in  the  solution 

to  the  observed  variation  at  the  kth  level . Rather  than  computing 

B row  by  row,  it  can  be  obtained  in  toto  if  the  sets  of  n- 
o- 

dimensioned  vectors  f and  of  m-dimensioned  vectors  g are  collected 
into  matrices.  One  version  of  this  procedure  considers  a set  of 
measured  vectors  f ^ and  associated  measured  g^ , g2/ 

...  g^  with 


(8) 


= ? 

^ Or 

?1 

fp  = B 
~2 

?2  ^ 
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+ e 
n X 1 


which  gives 


B = G ^ P; 

^ ^\j  '\j 


[ F - g(y^)f  (x  J ; ^ = g{y^)g(y_^)  ] 


(9) 


F is  essentially  the  cross-covariance  matrix  of  the  f*s  and  g*s 


a. 


while  G is  the  covariance  matrix  of  the  g's.  Again,  constraints 


must  be  applied  to  bring  about  stable  solutions  in  which  the  indi- 
vidual elements  of  B are  not  allowed  to  become  large  and  exces- 
sively  oscillatory.  This  can  be  done  in  exactly  the  same  way  as 
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is  done  in  the  case  of  vector  equations*  It  is  the  r6ws  of  B that 

% 

need  individually  to  be  constrained  and  this  is  most  easily  done 
if  the  equation  for  b the  rth  row  of  B can  be  written; 

'^r  f\j 


G b = f + e 
-r  ~r  -r 


G 


gi(y±) 


f 

-r 


f 1 (x^) 


(10) 


which  can  be  solved  under  the  constraint  b *b  = constant  by 

r r 

b = (G  * G + yD”^  G*f 
~r  ~r 

If  more  should  be  known  about  the  error  statistics,  they  can  be 
incorporated  into  the  solution  process  at  the  expense  of  a little 
additional  complexity. 


D.  Nonlinear  Iterative  Methods 

Linear  iterative  methods  have  found  application,  but  they  are 
not  fundamentally  different  from  other  linear  methods.  Nonlinear 
iterative  methods,,  on  the  other  hand,  are  different  and  appear  to  be 
capable  of  giving  good  inversions  where  more  direct  methods  have 
difficulty.  Such  methods  were  applied  to  atmospheric  sounding  by 
Chahine  (Ref.  4)  and  have  been  successfully  used  in  cloud-  and 
particle-size  distribution  problems  and  elsewhere.  They  consist 
of  taking  a first  guess  for  the  unknown  f (x)  and  then  modi- 

fying it  so  as  to  improve  the  discrepancies  between  the  measured 
g^  and  9^  = / K^(x)f(x)dx.  In  some  applications  the  entire  set  of 
gj^  are  calculated  and  the  first  guess  (or  subsequent  iterate)  is 
updated  simultaneously  at  all  tabular  x-values;  in  other  appli- 
cations the  first  guess  and  all  subsequent  iterates  are  updated 
as  soon  as  one  value  for  f*  has  been  calculated.  There  does  not 
appear  to  be  any  clear-cut  advantage  of  one  procedure  over  the 
other . 
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Adjustment  methods  in  nonlinear  iterative  algorithms  rely 
on  the  principle  that  the  change  in  g^  resulting  from  an  adjust- 
ment in  f (x)  around  x = 5 is  proportional  to  the  value  of  K^(5). 

In  some  algorithms  the  value  of  f (x)  is  changed  only  where  K^(5) 
is  greatest,  but  if  the  tabular  intervals  are  closely  spaced,  high 
frequencies  may  thereby  be  introduced  in  proportions  that  can 
become  excessive.  An  alternative,  which  in  the  writer's  experience 
can  be  superior,  is  to  make  the  change  at  x = 5 proportional  to  the 
value  of  the  kernel  there.  A useful  algorithm  based  on  this 
principle  is  as  follows:  given  an  iterate  / this  is  adjusted 

by  comparing  g^  and  g^  = K^(x)f^(x)dx  and  computing  the  new 
iterate  as 


4. 

m + 1 m 


g.  - gi 

1 + K (x) 

^ - 


(11) 


This  makes  the  greatest  proportional  change  where  (x)  is  greatest 
and  makes  no  change  where  is  zero. 

This  and  several  other  similar  algorithms  have  been  used  on 
a variety  of  problems  and  generally  show  excellent  stability  and 
independence  of  the  first  guess  (see  Fig.  4) . With  nonnegative 
kernels  and  positive  first  guess,  no  iterate  will  become  negative 
anywhere  provided  none  of  the  g values  are  zero  and  max  {k^(x)}  < 1, 
a condition  which  can  always  be  ensured  by  appropriate  scaling.  One, 
therefore,  has  a nonnegativity  constraint  built  into  the  algorithm 
and  there  is  little  doubt  that  that  is  a major  influence  on  the 
stability  of  the  inversion.  Since  products  of  kernels  are  generated 
as  the  iteration  proceeds,  the  bandwidth  of  admissible  frequencies 
increases  rapidly  whatever  the  first  guess,  but  this  is  a second- 
order  process  and  in  practice  high  frequency  oscillations  rarely 
seem  to  be  a problem  in  inversions  by  this  method  (whether  it  be 
applied  to  computed  data  or  real  measurements) . 

The  relative  change  made  in  f(x)  at  every  step  can  be  kept 
small  simply  by  limiting  it  within  the  algorithm;  to  the  first 
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Fig,  4,  Iterative  nonlinear  inversion  on  a particle  sizing 
problem.  Solid  line  and  open  circles  refer  to  the  two  very  dif- 
ferent first  guesses  f^  (x)  (shown  dashed) , 

order  the  solution  thus  produced  is  a linear  combination  of  the 
kernel  functions.  Such  a solution  can  be  generated  by  linear 
methods  also  — e.g.,  we  write  f(x)  = g^K^(x),  and  solve  for  5- 

The  equation  for  5 is 

C5  = g + e 

C being  the  kernel  covariance  matrix.  The  connection  between  non- 
linear iterative  solutions,  nonlinear  iterative  solutions  with 
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limited  adjustments  and  direct  linear  methods  might  usefully  be 
investigated  by  systematically  comparing  results  given  by  the 
three  methods  for  the  same  problem.  The  vTriter  is  not  aware  of 
this  having  been  done  at  the  time  of  writing. 


III.  ASSESSMENT  OF  THE  VARIOUS  PROCEDURES 

A.  Comparison  of  Results  on  a Standard  Problem 

There  are  many  ways  of  doing  this  and  they  will  not  always 
give  the  same  answer.  It  is  nevertheless  informative  to  take  a 
very  simple  problem  and  solve  it  in  a number  of  ways.  This  has 
been  done  as  follows: 

= Jq  K^(x)f(x)dx  (i  = 1,  2,  ...  N;  N = 20) 

K^(x)  = xe  (y^  = 0.1,  0.2,  ...  1.0,  1.2,  1.4,  ..., 

2.0,  2.5,  3.0,  3.5,  4,  5,  ...  10) 

f(x)  = 1 - 4(x  - 1/2)^ 

With  this  integrand,  a simple  weighted  trapezoidal  quadrature  gave 
the  g^^  with  an  rms  error  of  0.1%  over  all  y-values.  This  problem 
has  been  solved  in  a variety  of  ways  and  the  solutions  are  shown 
on  the  Fig.  5.  They  are: 

a.  Constrained  linear  inversion  with  minimum  constraint  H = 

I (i.e.,  minimum  "power"). 

b.  Backus-Gilbert  inversion,  error  magnification  - 100 
(Ref.  3). 

c.  Chahine’s  original  iterative  method,  only  one  ordinate 
adjusted  per  step  (Ref.  4) . 

d.  A modification  of  the  latter,  as  described  above - 

e.  Constrained  linear  inversion  with  end  points  fixed  at 
their  correct  values  [f(0)  = f(l)  = 2]. 

f.  Constrained  linear  inversion  with  end  points  fixed  at 
approximately  their  correct  values  f(0)  = 1,95,  f(l)  = 2.05. 
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There  is  no  point  whatever  in  trying  to  grade  the  methods  on 
the  basis  of  these  tests.  The  results  are  shown  mainly  to  demon- 
strate that  reasonable  inversions  can  be  obtained  by  each  of  the 
methods  and  that  rather  dramatic  improvements  (such  as  from 
Figs.  5(a)  to  5(e))  can  be  achieved  when  we  bring  in  a priori 
knowledge.  But,  as  Fig.  5(f)  shows,  we  must  be  sure  of  this 
a priori  knowledge;  otherwise,  more  harm  than  good  is  done  by  its 
incorporation . 

B.  Resolution 

There  is  a very  simple  method  whereby  we  can  make  estimates 

of  the  resolution  of  an  inversion  procedure  in  toto . To  do  this, 

we  simply  take  the  array  of  g^  values  corresponding  to  the  special 

case  f(x)  = 6 (x  - Xq)  and  applied  the  inversion  algorithm  to  that 

g-vector,  obtaining  a distribution  which  we  call  2:  (x^)  . A perfect 

inversion  procedure  would  return  6 (x  - x^)  when  given  as  input  the 

g-vector  (k (x  ) , i . e . , (K, (x  ) , (x  ) , . . . K (x  ) ) , and  the  extent 
z -o  i o z o mo 

to  which  any  procedure  fails  to  do  this  is  a useful  objective 
gauge  of  its  resolution.  (It  should  be  emphasized  that  "procedure" 
here  encompasses  the  set  of  kernels  which  are  used,  which  play  a 
more  important  role  than  the  algorithm  itself  in  determining 
resolution  or  lack  of  it.) 

Some  examples  of  resolution  tests  by  this  method  are  shown  in 
Fig.  6.  We  also  have  included  in  the  figure  Backus-Gilbert  (Ref.  3) 
scanning  functions  for  the  same  set  of  kernels  and  it  should  be 
noted  that  for  linear  inversion  methods  the  result  given  by  the 
above  6 function  test  is  in  fact  a scanning  function,  i.e.,  the 
solution  obtained  is  a convolution  of  that  function  with  the  origi- 
nal f(x).  In  the  case  of  nonlinear  inversions,  that  does  not  hold 
true,  but  the  significance  of  the  distribution  is  still 

clear . 
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of  the  delta  functions.  Kernels  of  form  xe 


C.  Merits  and  Disadvantages 

Each  of  the  procedures  possesses  certain  merits.  The  con- 
strained linear  inversion  is  fast,  especially  when  a large  number 
of  observed  g vectors  are  to  be  processed  in  the  same  way;  it  is 
objective  when  smoothness  constraints  are  employed,  and  it  allows 
anything  which  we  may  know  about  f (x)  to  be  built  into  the  con- 
straint. The  Backus-Gilbert  (Ref.  3)  method  is  aesthetically 
appealing  in  that  the  scanning  function  can  be  calculated  and 
graphed.  Statistical  techniques  give  more  realistic  soundings 
than  the  other  methods.  Nonlinear  iterative  methods  avoid  the 
necessity  to  formulate  explicit  constraints.  They  can  accommodate 
a wide  range  of  magnitude  in  f (x)  and  they  seem  generally  to  be 
superior  with  respect  to  resolution,  and  tolerance  of  errors. 


IV.  CONCLUDING  REMARKS 

The  preceding  discussion  was  not  intended  to  be  a step-by-step 
review  of  inversion  problems  and  algorithms-  Rather,  a number  of 
selected  points  were  given  attention  primarily  in  order  to  demon- 
strate that  there  is  not  a great  deal  of  difference  fundamentally 
between  the  various  procedures.  The  differences  lie  rather  in  the 
basic  selection  process  (without  which  no  stable  solution  can  be 
obtained).  If  we  decide  to  select  the  smoothest  solution,  v/e 
necessarily  damp  out  features  such  as  the  tropopause;  if  we  select 
the  solution  which  is  closest  to  some  statistical  expectation,  such 
features  will  appear,  but  not  necessarily  in  the  right  place;  we 
may  also  thereby  fail  to  see  rare  but  real  and  important  excursions. 
It  is  misleading  to  judge  the  result  of  an  inversion  on  the  basis 
of  "reasonableness."  Indeed,  in  the  case  of  such  things  as  temper- 
ature and  water  vapor  soundings^ it  seems  more  appropriate  that  the 
selection  of  "most  suitable"  constraints  and  filterings  be  based 
on  what  is  going  to  be  done  with  them,  not  on  some  subjective 
judgment  which  can  be  greatly  influenced  by  the  scaling,  etc., 
chosen  to  display,  in  some  form  or  other,  "true"  and  "computed" 
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soundings,  thickness  plots  or  whatever.  Nobody  really  cares  what 
the  temperature  is  over  some  point  at  9 km  (30,000  ft)  in  mid-Pacific 
nor  for  that  matter  what  the  200  mb  thickness  is  there  or  how  much 
water  vapor  resides  above  a particular  level.  These  quantities 
primarily  provide  input  for  analysis  and  prediction  procedures. 

Many  of  these  procedures  are  fundamentally  nonlinear  but  may, 
perhaps,  be  approximated  closely  by  a linear  operation.  In  a 
very  qualitative  and  general  way,  one  might  look  at  the  prediction 
of  tomorrow's  wind  field  in  the  Pacific  as  an  operation  on  today's 
field  and  today's  satellite  radiance  data.  If  today's  and 
tomorrow's  fields  were  accurately  known,  statistical  or  other 
techniques  could  be  utilized  to  optimize  the  prediction,  virtually 
ignoring  the  physics  involved.  If  this  is  not  possible,  at  least 
the  selection  of  "best"  from  the  large  population  of  profiles 
which  are  all  equally  acceptable  from  the  radiance  point  of  view 
should  surely  be  based  as  much  as  possible  on  the  intended  appli- 
cation of  the  soundings,  and  users  of  inversion  data  should  be 
more  actively  brought  into  the  selection  process. 

As  was  mentioned  earlier,  appreciable  improvement  on  resolu- 
tion by  improving  the  accuracy  of  measurement  is  hardly  practicable 
since  a very  great  improvement  in  accuracy  produces  only  a very 
modest  improvement  in  resolution.  There  does  appear  to  be  room 
for  improvement  in  accuracy  so  far  as  the  kernels  are  concerned. 


SYMBOLS 


k(x)  kernel  function 

k^(x)  elements  of  the  kernel  function 

f(x)  "sought"  function 

g function  representing  measured  quantities 

t time 

p pressure 
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DISCUSSIONS 


Chahlne:  You  have  made  a flat  statement  that  this  type  of  integral 

equation -has  an  infinite  number  of  solutions,  Do  you  say  that  this 
is  true  for  any  kernel  and  for  any  foinn  of  the  given  function, 
whether  it  is  continuous  or  discreet? 

Twomey:  One  should,  perhaps,  modify  that  to,  say,  physical 

kernels,  the  kernel  which  one  is  likely  to  encounter  in  a physical 
measurement.  It  would  certainly  not  be  true  for  many  mathematical 
kernels,  such  as  delta  functions. 

Chahlne:  Not  necessarily.  Exponential  kernels  can  give  you  a 

unique  solution  sometimes . 

Twomey:  Well,  you  can  demonstrate  in  the  presence  of  finite 

accuracy — you  only  have  to  go  to  a high  enough  frequency  and  you 
get  ambiguity. 

Chahlne:  The  nonuniqueness  can  be  due  to  noise  in  the  given 

function  (data)  or  in  the  kernel-  But  it  is  not  necessarily  an 
intrinsic  property  of  this  integral  equation.  I would  like  to 
make  this  point  clear. 

Twomey:  Yes.  I would  agree.  However,  I would  comment  that  I 

think  from  the  point  of  view  of  keeping  oneself  out  of  trouble, 
one  would  be  better  off  to  feel  that  there  are  an  infinity  of 
solutions  than  one  would  be  to  think  there  is  a unique  solution. 

Chahlne : Physically? 

Twomey:  Yes,  from  the  point  of  keeping  oneself  out  of  trouble. 

You  can't  get  into  trouble  mathematically. 

King:  Don't  you  think  that  part  of  the  problem  of  oversell  is 

related  to  formatting?  Inversion  efforts,  thus  far,  have  been 
directed  toward  mimicking  the  infinitely  resolved  temperature  pro- 
file given  by  the  radiosonde-  And  this,  of  course,  the  inversion 
is  incapable  of  doing  in  principle.  Isn't  it  preferable  to  format 
the  information  in  terms  of,  say,  the  six  parameters  that  one  could 
infer  from  six  radiance  observations?  These  six  gross  atmospheric 
structure  parameters  would,  perhaps,  be  of  more  use  to  numerical 
weather  predictors  anyway,  since  their  algorithms  are  not  really 
sensitive  to  the  kind  of  detail  provided  by  the  radiosonde. 

Twomey:  I agree  100  percent,  Jean,  yes.  I think  we  should  even 

go  a step  further  than  this  because  (I  think  I said  it  in  the 
written  version  of  this  paper),  I don't  think  any  of  us  really 
care  what  the  temperature  is  200  millibars  above  Einewtok  or 
somewhere  like  that.  It  is  only  data  that  is  going  into  a 
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circulation  model  or  a prediction  model  and  so  forth.  The  first 
thing  those  people  are  going  to  do  is  try  to  apply  some  of  their 
own  numerical  processes  to  that.  I mean,  for  example,  if  you  feed 
them  a superadiabatic  lapse  rate,  they  are  going  to  make  a con- 
vective adjustment  to  it.  So  the  first  thing  they  are  going  to  do 
with  a nice  temperature  profile  which  you  have  at  great  troiible 
and  expense  fashioned,  they  are  going  to  change  it.  So  I think  you 
should  look  at  that  and  incorporate  that  and  see  what  is  best  for 
them.  It  may  be  that  some  of  the  things  inversion  people  are 
sweating  to  try  to  put  into  their  profiles  are  not  actually  needed 
by  the  people  who  are  going  to  use  those  profiles.  I think  it  is 
also  up  to  the  people  who  are  going  to  use  those  profiles  to  state 
what  they  want,  not  the  way  they  have  been  stating  it.  I think 
when  Dave  Wark  and  I talked  to  people  some  years  ago,  they  would 
say,  oh,  we  want  a quarter  of  degree;  we  want  at  least  as  good  as 
the  radiosonde;  we  want  better  than  the  radiosonde.  But  they  have 
a grid  in  which  the  whole  atmosphere  is  in  about  160  km  (100  mi) 
boxes.  So  they  are  only  putting  themselves  on  if  they  say  they 
want  radiosondes  every  100  meters  and  with  accuracy  of  0.3048  (1  ft) 
resolution,  because  they  are  not  going  to  be  able  to  handle  it. 

Kaplan:  Well,  if  we  get  a superadiabatic  lapse  rate, we  better  not 

feed  it  to  them  to  make  the  convective  adjustment,  because  we  are 
supposed  to  be  sounding  the  atmosphere.  And  we  better  do  the 
adjustment  because  we  know  how  to  do  it  better  than  they  do.  And 
it  isn't  the  convective  adjustment  that  is  required.  There  are 
real  needs  and  I think  we  have  to  try  to  answer  those  needs.  And 
I think  probably  what  is  necessary  is  to  see  what  it  is  that  really 
is  wanted  and  probably  what  is  wanted  in  numerical  weather  pre- 
diction is  mean  temperatures  between  constant  pressure  levels  or 
constant  height  levels.  And,  I think  we  ought  to  see  whether  that 
can  be  produced  and  the  solutions  given  in  those  forms.  And  if  it 
can't,  there  should  be  a negotiation  going  back  and  forth  and  trade- 
offs between  the  people  doing  the  soundings  and  the  users,  because 
the  soundings  are  not  being  done  for  its  own  sake.  I mean,  some 
of  us  may  like  the  idea  of  the  game,  but  the  real  purpose  is  being 
able  to  improve  the  weather  forecast. 

Goldman:  I was  wondering  if  you  could  expand  your  opinion  about 

the  accuracy  of  the  constraint  against  no  constraint  at  all?  You 
have  shown  in  your  example  that  small  error  in  the  constraint  can 
give  much  worse  inversion.  T^d  I was  wondering  if  you  could 
expand  a little  bit  on  this  by  some  actual  examples? 

Twomey:  Well,  no,  I can't-  I have  that  example  there,  but  one 

could  go  on  and  on,  I mean,  calculate  various  examples.  I think 
it  is  fairly  obvious  what  is  going  on.  If  there  are  real  con- 
straints, which  there  are  in  many  physical  problems,  of  course, 
especially  at  the  endpoints  of  your  interval,  there  is  likely  to 
be  some  kind  of  physical  or  essential  constraint  there  quite 
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outside  your  inversion  problem^  If  it  is  of  that  nature  and  you 
can  state  accurately  what  it  is,  you  can  only  benefit  by  putting 
it  in.  But  just  exactly  how  much  error  you  need  to  be  in  that 
constraint  before  it  is  doing  more  harm  than  good,  that  would 
obviously  depend  on  your  particular  problem.  You  know  the  pre- 
cise numbers  that  you  are  working  on.  It  is  a simple  thing  to 
test,  obviously. 
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GENERALIZATION  OF  THE  RELAXATION  METHOD  FOR 
THE  INVERSE  SOLUTION  OF  NONLINEAR  AND 
LINEAR  TRANSFER  EQUATIONS 


Moustafa  T-  Chahine 
Jet  Propulsion  Laboratory 
California  Institute  of  Technology 


A mapping  transformation  is  derived  for  the  inverse 
solution  of  nonlinear  and  linear  integral  equations  of 
the  types  encountered  in  remote  sounding  studies  - The 
method  is  applied  to  the  solution  of  specific  problems 
for  the  determination  of  the  thermal  and  composition 
structure  of  planetary  atmospheres  from  a knowledge  of 
their  upwelling  radiance. 


I.  INTRODUCTION 

The  problem  of  determining  an  unknown  function  g(z)  from  the 
integral  equation 

I (V)  = W g (z) 

given  the  function  I (v)  and  the  integral  operator  N , has  arisen 
repeatedly  in  problems  of  radiative  transfer  as  in  Ref-  1 and  in 
transport  theory  as  in  Ref-  2.  The  mathematical  problem  here  is 
a difficult  one  and,  in  fact,  may  not  always  have  a solution  for 
an  arbitrary  function  I(v)-  The  difficulties  are  compounded  by 
the  facts  that  I(v)  is  obtained  in  general  from  measurements  con- 
taminated with  noise  and  that  the  integral  operator  itself  is  an 
approximation  to  a real  physical  process* 

In  this  paper,  we  treat  a specific  class  of  linear  and  non- 
linear integral  equations  in  which  the  kernel  has  the  important 
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property  of  reaching  its  maximum  peak  at  different  values  of  z for 
different  values  of  v.  This  mathematical  property  is  very  common 
in  transfer  and  transport  problems  where  the  dominant  physical 
process  takes  place  within  a narrow  segment  of  the  range  of  inte- 
gration. 

In  the  following  section,  we  will  develop  a relaxation  method 
of  solution  based  on  the  principle  of  mapping  transformations  to 
recover 

g(z)  =W  ^I(v) 

and  study  the  stability  and  accuracy  of  the  solutions  in  the 
presence  of  noise  in  the  given  data.  The  remaining  sections  of 
this  paper  will  be  devoted  to  the  study  of  the  inverse  problems 
for  the  determination  of  atmospheric  temperature  and  composition 
profiles  from  remote  sounding  radiance  data. 


II.  MATHEMATICAL  FORMULATION 

The  formulation  of  remote  sounding  problems  in  radiative 
transfer  leads  often  to  nonlinear  integral  equations  of  the  foim 


rz 


I(v)  = B[v,g(z^)]  C(v,z^)  + 


B[v,g(z) ] K(v,z)  dz 


(1) 


In  some  cases,  however,  the  resulting  integral  equations  are 
linear  in  g(z)  and  of  the  form 


i(v)  = g(z^)  c(v,z^) 


K(v,z) 


dz 


(2) 


In  Eqs.  (1)  and  (2)  C(v,z^)  and  K(v,z)  describe  specific 
radiative  transfer  processes  such  as  absorption,  emission,  or  scat- 
tering in  the  atmosphere.  In  many  problems  of  remote  sounding, 
this  kernal  K(v,z)  reaches  its  maximum  peak  at  different  values  of 
z for  different  values  of  v.  I(v)  is  a given  function,  usually 
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measured  at  a discrete  number  of  observations  Kv^),  and  g(z)  is 
the  distribution  of  an  atmospheric  parameter  to  be  determined. 

Thus,  the  inverse  solution  here  reduces  to  finding  a function  g(z) 
such  that  when  it  is  substituted  into  Eqs.  (1)  or  (2) , it  will  yield 


with 


i(Vj)  - I(v^)  = 0 
for  all  the  given  values  of  v ^ . 

In  remote  sounding  problems  there  is  usually  no  doubt  of  the 
physical  existence  of  a solution  or  perhaps  even  of  its  uniqueness. 
But  from  a mathematical  point  of  view,  the  problems  of  demon- 
strating existence  and  ensuring  uniqueness  of  g(z)  are  of  great 
importance  and  are  directly  related  to  the  various  simplifying 
physical  and  mathematical  assumptions  made  in  the  derivation  of 
the  integral  equation  and  depend  also  on  the  information  content 
of  the  measured  data.  Therefore,  a general  treatment  of  the  prob- 
lems of  existence  and  uniqueness  for  Eqs.  (1)  and  (2)  is  difficult 
unless  it  is  approached  from  the  narrow  point  of  view  of  the 
dependence  of  the  solution  on  the  initial  guess  and  the  inter- 
polation (or  quadrative)  method  used.  However,  accurate  demon- 
stration of  existence  and  uniqueness  may  be  carried  out,  in 
principle,  for  some  problems,  depending  on  the  form  of  the  kernel 
and  the  degree  of  discretization  of  the  function  i(Vj)* 

A.  General  Method  of  Solution 

The  right-hand  side  of  Eqs.  (1)  and  (2)  may  be  viewed  as  an 
integral  operation  transforming  variations  of  g with  respect  to 
z into  variations  of  I with  respect  to  v as 

I(v)  = Wg(z).  (3) 

To  obtain  g(z)  we  need,  therefore,  to  perform  an  inverse  trans- 
formation from  the  (I,v)  plaine  into  the  (g,z)  plane  with 
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(4) 


g(z)  = W ^ I(v) 

We  can  accomplish  this  by  applying  a mapping  transformation  which 
maps  points  on  the  v-axis  into  corresponding  points  on  the  z-axis 
and  similarly  maps  points  on  the  I-axis  into  points  on  the  g-axis. 

1,  The  V - z Mapping  Transformation 


To  map  the  v-axis  into  the  z-axis,  let  z^  be  a point  between 

z and  z where  K(v.,z)'  reaches  its  maximum  value  or 
o D 


9K(Vj,z) 

3z 


= 0 


(5) 


z = z . 
3 


From  Eqs-  (1)  or  (2)  we  note  that  variation  in  g(z)  around  z^ 

should  affect  the  values  of  I(Vj)  very  strongly  while  variation  in 

g(z)  at  values  of  z <<  z^  and  z >>  should  not  affect  I(Vj)  by 

the  same  magnitude.  We  propose,  therefore,  to  use  this  property 

to  map  this  point  v^  into  z^  on  the  z-axis.  In  general,  since 

K(v.,z)  reaches  its  maximum  values  at  different  values  of  z . , 

: 3 


j = 1,  2,  ...  J for  different  values  of  v ^ , j = 1,  2,  ...  J. 
can  use  Eq.  (5)  to  derive  a relationship  between  v and  z 


We 


V . ^ z . 

3 3 

and  map  different  points  on  the  v-axis  into  different  points  on 
the  z-axis. 


2.  The  I - g Relaxation  Transformation 

Mapping  of  the  I-axis  into  the  g-axis  is  much  more  difficult 
and  needs  to  be  carried  out  by  iteration.  We  apply  the  mean  value 
theorem  (or  the  method  of  steepest  descent)  to  Eq.  (1)  and  derive 
a relaxation  equation  of  the  form 


_ r (n  + 1)  , . n 

B[Vj,  g (Zj)] 

B[v.,  g^^hz.)] 


I(v.) 

J 


(6a) 
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Equation  (6)  is  written  in  an  iterative  form  where  g 
nth  guess  of  the  solution.  I 


(z . ) is  the 
(v.)  is  the  corresponding  value 


computed  according  to  Eq.  (1).  given  and  g 


(n  + 1) 


(-) 


is  the  resulting  (n  + 1)  guess.  Similarly,  the  relaxation  equa- 
tion corresponding  to  Eq.  (2)  is  of  the  form 


g^"^  (z.) 


I(V^) 

(V.) 


(6b) 


Details  of  this  derivation  are  given  by  the  author  in  Refs.  3 and 


4.  Equations  (6a, b)  transform  changes  in  the  ratio  I(Vj)/I 


(n) 


at  different  points  on  the  v-axis  into  changes  in  g^^^ (z^)  at 

specific  points  along  the  z-axis.  The  mechanism  of  this  trans- 

(n) 


formation  is  iterative  in  which  g 

. • n j T (n  + 1) 

n to  yield  a new  value  g (Zj) • 


(Zj)  is  modified  at  every  step 


It  is  possible  to  generalize  Eqs.  (6a, b)  formally  and  write 
the  relaxation  transformation  as 


(n  + 1)  , . (n)  (n)  , . 

g^  (Zj)  = g'  ' (Zj) 


(7) 


where  a . 

3 

or  (6b) . 


(n) 


are  scaling  factors  computed  directly  from  Eqs.  (6a) 


(It  is  sometimes  necessary  to  use  a weighted  form  of  the 
scaling  factors,  as  defined  in  Eq.  (10) , in  which  several  values 

. j",  are  used  to  derive  each  scaling  factor 
This  approach  will  be  discussed  later  in  this  section.) 


of  I (V  ) , j = j ' , 

_ (n)  ^ 
ot . 


J 

3. 


Iterative  Steps  of  the  Method  of  Solution 


Assume  that  a set  of  measurements  ^ j ) is  given  for  j = 
2,  ...  J and  use  Eq.  (5)  to  find  the  corresponding  values  of 

z , , ^ If  ^ f ...  J • 

(^ ) 

(a)  Make  an  initial  guess  (n  = 0)  for  g^  iz) . 


If 


ate  I 


(b) 

(n) 


Substitute  g^^^ (z)  into  Eq.  (1)  [or  Eq.  (2)]  and  evalu- 
(Vj)  j = If  2,  ...  J using  an  appropriate  interpolation 
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(quadrature ) formula . 

(c)  Check  the  residuals, 

, . i(v  ) - (V  ) 

ptn)  , V ^ L 3_ 

^ l(  .) 


j — 1,  2/  •••  J 


If  all  (Vj)  ->■  0,  then  is  a. solution.  If 


not,  then  obtain  a new  guess. 


(n  + 1) , . (n)  _(n) , . 

g (z . ) = a . T (z . ) 

3 3 3 


(d)  Go  to  step  (b)  and  repeat  until  step  (c)  is  satisfied. 


B.  Analytical  Example 

In  order  to  illustrate  the  relaxation  method  of  solution,  we 
suggest  the  following  analytical  example. 

Let 

I (v)  = g(z)  e ^ ^ dz 

J —00 

and  given 

- (z  - v)2 

I(v)  = e 

where  is  a constant,  find  g(z)  by  the  method  of  relaxation. 

(The  answer  is  a delta  function,  g(z)  = ^ (z^  - z) , which  we  will 
try  to  determine  next.) 

1.  The  v-z  Mapping  Transformation 

The  kernel  e has  a maximum  at  v = z and  thus  the 

corresponding  mapping  transformation  from  the  v-axis  to  the  z-axis 
is  V = z. 

2.  The  I - g Relaxation  Transformation 

The  corresponding  relaxation  equation  according  to  Eq.  (6b) 
is 
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(n  + 1)  . . (n)  . . I(v) 

g (z)  = g (z)  


(V) 


V = z 


3.  Iterative  Solution 


In  this  ideal  analytical  example  we  do  not  need  to  specify  an 
interpolation  formula  because  we  are  dealing  with  a continuous 
function  I (v)  which  maps  the  entire  v-axis  into  the  entire  z-axis. 

To  start  the  iteration  ( n = 0 ) let  us  assume  a constant 
value  for  the  initial  guess;  for  example 


and  apply  the  mapping  transformations  to  generate  the  following 
iterative  solutions.  We  get 


where  the  coefficients  C are  obtained  from  the  following  recur- 

n 

rence  relaxation 


C = 0 
o 


c = c + 

n n - 1 1 + C 


n - 1 


( n } 

To  show  that  the  recovered  solution  g ^ (z)  is  a delta 


function  centered  at  z^,  we  can  easily  prove  that, as  n ^ 


C oo 
n 


4-1-00 

j .oo 


dz  ^ 1 


and 

d 

n 
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where  is  the  width  of  the  function  (z)  at  half  peak  as 

shown  in  Fig.  1. 


Fig,  1,  The  analytical  illustration. 


C.  Stability  and  Convergence  of  Solutions 

The  previous  analytical  example  offers  an  ideal  case  in  which 
the  entire  v-axis  maps  into  the  entire  z-axis,  the  niimber  of  iter- 
ations that  can  be  carried  out  is  infinite, and  the  given  function 
I(v)  is  exact.  In  remote  sensing  problems,  however,  we  are  given 
small  (and  sometimes  incomplete)  data  sets  Kv^),  j = 1,  2 ...  J 
where  the  peaks  of  the  kernels  are  not  uniformly  distributed  and 
where  both  ICv^)  and  K(v.,  z)  are  known  with  a certain  degree  of 
uncertainty.  Under  these  conditions,  it  becomes  sometimes  neces- 
sary to  (i)  supplement  the  data  with  a priori  infoimiation  about 
the  expected  result,  (ii)  use  weighted  sealing  factors  a instead 
of  a in  order  to  minimize  the  effects  of  large  measurement  noise 
on  the  accuracy  and  stability  of  solutions,  and  (iii)  establish 
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ntimerical  criteria  for  determining  the  optimum  number  of  iterations 
required.  We  will  examine  these  aspects  of  the  problem  in  the 
remaining  part  of  this  section. 

D.  Properties  of  the  Residuals 

Application  of  the  relaxation  method  to  a variety  of  linear 
and  nonlinear  problems  has  shown  that  in  the  absence  of  noise  in 
the  given  data,  the  computed  values  of  the  function  approach 

the  measured  data  after  a small  number  of  iterations.  Their 

residual  difference  tends  asymptotically  to  zero  (or  to  the 

value  of  the  quadrature  errors) in  a root-mean-square  (rms)  sense  as 


But,  if  we  add  noise  to  the  data  as 
I* (Vj)  = I(v^)  + 

we  observe,  that  the  residuals,  in  this  case,  decrease  first  rapidly 
and  then  approach  an  asymptotic  value  equal  to  the  noise  in  the 
data  in  an  rms  sense  with 


(This  property  will  be  discussed  in  more  detail  in  the  following 
section  and  will  be  illustrated  in  Fig.  6.) 

Thus,  in  the  presence  of  random  errors  in  measurements,  the 
residuals  do  not  give  a false  indication  of  convergence  because 
they  will  not  tend  toward  zero.  The  residuals  first  decrease  and 
then  approach  an  asymptotic  value  of  the  same  order  of  magnitude 
as  the  errors  in  measurements.  This  property  is  due  to  the  partial 
overlapping  of  the  kernels  and  suggests  that  the  iterative  process 

/ ri  \ 

should  be  terminated  when  r'  approaches  its  asymptotic  value. 
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The  effect  of  continued  iterations  beyond  this  point  does  not 

always  increase  the  amount  of  information  extracted  from  the  data, 

and  in  the  presence  of  large  noise  levels  in  the  measurements 

might  lead  to  oscillations  in  the  recovered  solutions.  To  prevent 

such  oscillations  from  happening,  it  is  advisable  to  apply,  after 

a few  iterations,  say  n > 3,  certain  weights  to  the  scaling 

coefficients  a)  in  the  form  of 
D 


j = 1,  2 


J 


(10) 


and  use  "to  generate  the  new  iterations  as 

(n  + 1)  , , (n)  , . 

g (2  J = Uj  g (Zj) 


(11) 


The  use  of  will  slow  down  the  rate  of  convergence  considerably 
but  will  tend  to  diminish  the  effects  of  random  noise  in  the  data 
on  the  iterative  solution. 


Different  forms  of  (v^^)  can  be  adopted,  such  as  taking 

W.(v  ) equal  to  the  fractional  value  of  the  kernels  K(v,  z)  at 
3 ^ 


z . for  V 
3 1 


2 3 


In  this  case,  we  write 


(12) 


where  K(v  . z,  ) is  the  maximum  value  of  the  kernel.  Additional 
k k 

details  on  the  convergence  properties  of  the  solution  can  be  found 
in  Refs.  5,  6,  and  7. 


Ultimately,  the  iteration  process  should  be  terminated  when 
the  rate  of  convergence  of  the  solution  with  respect  to  itself, 
defined  as 


A (^) 

<Ag  > 


av 


' J = 1 


(n  + 1)  , . (n)  , . 

g (z  J - g (Zj) 


(13) 


approaches  a certain  prescribed  value. 
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E . Interpolation  Methods 

The  interpolation  aspects  of  the  solution  are  not  trivial, 
particularly  in  cases  where  the  number  of  measured  data  points  is 
deficient.  The  recovered  solution  is  a function  of  the  number  of 
useful  observations  available  and  of  the  interpolation  method  used- 
The  selection  of  a suitable  interpolation  formula  is  one  of  the 
subjective  aspects  of  this  problem.  While  it  is  always  possible 
to  apply  many  different  interpolation  methods,  it  is  obvious  that 
from  a finite  set  of  J measurements,  it  is  impossible  to  recover 
more  than  J independent  parameters. 

Application  of  the  relaxation  method  of  solution  to  a variety 
of  problems  has  shown  that  the  resulting  solutions  are  independent 
of  the  initial  guess  but  depend  on  the  interpolation  formula 
selected  to  determine  the  solution  g(z)  at  the  intermediate  values 
between  = 1,  2,  ...  J) . Linear  interpolation  methods  are 

recommended  in  the  absence  of  any  infoirmation  about  the  expected 
solutions;  however,  other  interpolation  methods  can  be  applied 
when  needed. 


In  cases  when  a priori  knowledge  of  the  shape  of  the  expected 

solution  is  given,  a perturbation  approach  is  recommended  to  make 

use  of  the  available  information.  This  can  be  accomplished  in  the 

iteration  process  by  including  in  the  initial  guess  all  available 

information  about  the  shape  of  the  expected  solution.  To  preserve 

this  shape  in  subsequent  iterations , we  perform  the  interpolation 

on  the  scaling  factors  (z.)  and  generate  scaling  factors  at 

all  intermediate  values  of  z,  af^l(z).  The  complete  interpolated 

int 

solution  is  then  obtained  at  all  values  of  z as 


(n  + 1)  , . (n)  , . (n)  , . 

g (z)  = a.  (z)  g (z) 

int 


(14) 


It  is  obvious  that  the  same  interpolation  procedure  can  be  per- 
formed also  when  is  used.  In  this  process,  the  final  answer 
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is  made  to  depend  on  the  initial  guess  by  preserving  the  form  of 
the  input  function  g^^^z)  in  all  the  steps  of  the  iterative 
solution. 

In  general,  however,  reliance  on  a priori  information  about 
the  expected  solution  should  not  be  considered  unless  the  number 
of  available  measurements  is  insufficient  and  unless  the  infor- 
mation content  of  measurable  data  is  incapable  of  recovering  cer- 
tain essential  features'  of  the  solution,  such  as  the  location  of 
the  tropopause.  However,  the  fundamental  mathematical  condition 
that  from  a set  of  J measurements  it  is  possible  to  recover  only 
J'  independent  parameters  such  that  J'  < J remains  the  rule. 

In  the  following  sections,  we  will  apply  the  relaxation 
method  to  two  remote  sounding  problems  and  study  the  properties  of 
the  solution  and  quality  of  results  specifically  for  the  deter- 
mination of  the  thermal  and  composition  structure  of  planetary 
atmospheres. 


III.  THE  INVERSE  PROBLEM  FOR  TEMPERATURE  PROFILES 


In  the  problem  of  remote  sounding  of  atmospheric  temperature 
profiles,  the  following  equation  occurs 


rz 


I(v)  = B['V,T(z^)  ]t(v,  z^)  + 


B[v,T(z) ]K(V,  z)dz 


(15) 


Equation  (15)  is  the  integral  form  of  the  radiative  transfer 

equation  for  a plane  parallel  homogeneous  and  nonscattering  atmos^ 

phere  in  local  thermodynamic  equilibrium.  I (v)  is  the  outgoing 

radiance  measured  at  a vertical  distance  z from  the  surface  z of 

o 

a planet  within  a narrow  solid  angle  around  the  local  vertical 
axis,  z.  B is  the  Planck  function  explicitly  given  as 

bv 

B[v,T(z)]  = av3/(e  - 1)  (16) 

where  a and  b are  two  given  constants.  t(v,  z)  is  the 
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transmittance  of  a coliunn  of  absorbers  between  levels  z and  z and 
is  defined  for  monochromatic  observations  as 


t(v,  z)  = exp 


p^(z')k(v. 


z')dz' 


(17) 


L . 2 J 

where  k(V/  z)  is  the  absorption  coefficient  at  v due  to  all  lines, 
and  can  be  represented  for  a Lorentz  profile  by  the  equation 


k(v,  p)  = ^ 
i 


S^(T) 

ir 


aj^(T,  p) 

(V  - v^) 2 (T,  p) 


(18) 


where  is  the  strength,  is  the  half  width  of  the  line  and 
is  the  frequency  at  peak  intensity  of  the  line.  The  density  pro- 
file of  the  absorbing  gas  s is  given  by  p (z*)-  The  kernel 

s 

K(v,  z,  p , ...)  is  defined  as 
s 


K(v,  z) 


3t(v,  z) 
3z 


(19) 


The  pressure  p(z)  is  related  to  z through  the  hydrostatic  equation 
dp  = - p(z)  g dz  (20) 

where  p (z)  is  the  density  profile  of  the  entire  atmosphere. 


In  practical  observations,  measurements  of  I (v)  are  made  at 
a discrete  number  of  frequencies  centered  within  a finite  band 
Av  with 


I(Vj) 

where  (v  . 


•V* 

= (Hv.,  v)I{v)dv 

J v"  ^ 

, v)  is  the  instrument  function. 


(21) 


From  a practical  point  of  view,  it  is  advisable  (but  not 
necessary  for  the  present  method)  to  substitute  Eq.  (15)  into 
Eq.  (21)  and  define  the  transmittance  T^(^j/  z)  in  the  interval  Av 
as 


r’ 

t(v  , z)  = , v)t(v.  , z)dv 

J II  ^ 9 


(22) 
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since  B(v^  T)  is  a smooth  function  in  the  interval  Av,  we  take 
B(V,  T)  = B(V^,  T) 

and  rewrite  Eq.  (21)  as 

rz 

I(V.)  = B[v.,  T(z  )]t(v.,  z^)  + B[v.,  T(z)]K(v.,  z)dz  (23) 

D J o 3 o j 3 

o 


In  the  rest  of  this  paper,  we  will  deal  with  Eq.  (23)  whether  we 
refer  to  the  measured  radiance  as  I(v^)  or  as  I(v). 

From  a given  set  of  values  of  Kv^),  j = 1,  2 ...  J,  we  want 
to  determine  the  temperature  profile  T(z.),  assuming  that  p (z) , 
t(v.,  z)  and  K(v.,  z)  are  known.  In  this  problem,  the  selection 
of  the  set  and  the  determination  of  T(z^)  are  strongly  related 
and  form  the  basis  for  the  method  of  inverse  solution  of  Eq.  (23) . 


By  selecting  a set  of  frequencies  with  varying  degrees  of 
atmospheric  attenuation  such  that  t(v  , z^)  > t(v.,  z) , ...  > 
x(Vj,  z) , we  can  generate  a set  of  kernels  K(v^,  z)  such  that  for 
each  value  of  the  kernel  possesses  a maximum  at  a different 
value  of  Zj  as  shown  in  the  illustration  (Fig.  2)  , 


A.  The  Mapping  Transformation 

Equation  (23)  is  a nonlinear  integral  equation  with  fixed 
limits  which  may  be  viewed  as  a nonlinear  transformation  from 
T(z)  to  I(v)  as  in  Eq.  (3),  namely, 

i(v)  =Nt(z) 

To  obtain  T(z) , we  need  to  perform  an  inverse  transformation  as 
in  Eq.  (4) , namely 

T(z)  = i(v) 

Figure  2 strongly  suggests  a mapping  transformation  from  the 
V axis  to  the  z axis.  Since  the  kernels  K(v.,  z)  are  strongly 
decaying  functions,  variations  of  T(z)  around  z^  will  affect  the 
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values  of  ^(^j)  very  strongly,  while  variations  of  T(z)  at  values 
of  z <<  Zj  and  z >>  z^  do  not  affect  l(v^)  appreciably.  Hence, 
we  propose  to  map  into  z^  where  z^  corresponds  to  the  peak 
value  of  the  kernel  K(v.,  z) . Mathematically,  we  derive  the  trans- 
formation 


V . 

3 


V(Zj) 


(24) 
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from  the  solution  of  the  equation 


8K(Vj,  z) 
3z 


= 0 


( j = 1,  2,  3 ...  j) 


(25) 


Equation  (25)  can  be  used  immediately  to  map  the  set  of  J points 
on  the  V axis  into  a set  of  J points  on  the  z axis. 

But  in  order  to  map  the  I axis  into  the  T axis,  we  need  a 
relationship  between  Kv^)  and  T(z^)  . The  author  (Refs.  3 and  4)  has 
applied  the  mean  value  theorem  to  Eq.  (9)  and  derived  the  following 
relaxation  equation 


B 


'v., 


+ 1) 


b|^v^,  (Zj)J 


I(V  J 


(V.) 


(26) 


Equation  (26)  relates  changes  in  the  outgoing  radiance  for  one  fre- 
quency V j with  changes  in  the  Planck  function  at  one  level  z ^ , 
as  illustrated  in  Fig.  2.  Equation  (26)  is  expressed  in.  an  itera- 
tive form  useful  for  our  purposes  where  (z)  and  t^^  (z) 

are  two  temperature  profiles  at  different  orders  n of  an  iterative 
solution.  I 


(V.)  is  the  radiance  computed  from  Eq.  (23)  for  a 


An) 


given  T (z),  and  I(v).  is  the  measured  radiance.  For  additional 
details  regarding  the  derivation  of  Eq.  (26) , see  Eqs.  (6)  to 
(9)  in  Ref.  4.  We  should  note  here  that  Eq.  (26)  can  be  derived 
also  by  applying  the  method  of  steepest  descent  as  indicated  in 
Ref.  2 by  the  author. 


B.  The  Iterative  Method  of  Solution 

We  proceed  to  solve  Eq.  (23)  for  the  determination  of  T(z)  by 
iteration  as  follows : 


Assume  a set  of  measured  radiances,  i(Vj)  is  given  for  j = 1, 


2,  ...  J. 


(n) 


1.  Make  an  initial  guess  (n  = 0)  for  T (z) . 

/ r»  \ 

2.  Substitute  t'  (z)  into  Eq.  (23)  and  evaluate  the 
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corresponding  (v^)  j = 1,  2,  J using  in  this  case  a linear 

interpolation  formula. 

3.  Check  the  residuals 


1 

rJ 

(V^)  - (Vj) 

2 - 

J 

= 1 

- 

1/2 


(27) 


for  each  frequency  and  in  an  rms  sense.  If  is  small,  then 

(z)  is  a solution.  If  R^^^is  not  small,  go  to  step  4. 

4.  Obtain  a new  guess 

T<"  * »(z.)  - 


where  the  scaling  factors  are  obtained  from  Eqs.  (16)  and  (26)  as 


bv  (z .) 

__J L 


(n) 

Ct  ' = " 

^ In  {l  - [1  - exp(bVj/T^^^  (Zj) ) ]l  (V  J/i  (v  J } (28) 

A second  criterion  to  establish  convergence  of  the  iterative 
solution  was  applied  here.  The  criterion  is  obtained  by  observing 
the  rate  of  convergence  of  the  temperature  profile  with  respect  to 
itself  as 


av  J j = 1 


T^^^z.)  - -'\z.) 


(29) 


The  iteration  is  terminated  when  <At^^^>  is  less  than  some  pre- 

av 

scribed  value,  say  0.1  K. 

5.  Go  to  step  2 and  repeat  until  step  3 or  Eq.  (29)  is 
satisfied. 


C.  Accuracy  of  the  Results 

The  relaxation  method  of  solution  has  been  applied  to  invert 
synthetic  radiance  data  generated  by  a computer  from  a set  of 

model  temperature  profiles  in  an  atmosphere  having  a constant  CO 

-6  ^ 
mixing  ratio  of  462  x 10  by  mass.  The  spectral  inteirval  selected 
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to  illustrate  this  study  corresponds  to  the  set  of  fre- 

quencies shown  in  Fig.  2. 

The.  instrumental  slit  function  (p(v.,  v)  is  taken  to  be 

triangular,  symmetrical  with  respect  to  v.,  and  having  a base  width 
-1  ^ 

equal  to  60  cm  - A typical  example  of  the  accuracy  of  the  recon- 
structed temperature  values  is  shown  in  Fig.  3. 

-Z.  Uniqueness  of  Solution 

The  typical  results  of  Fig.  3 show  two  interesting  properties 
which  call  for  comment.  First  of  all,  examination  of  the  solu- 
tions/ obtained  by  using  as  an  initial  guess  any  isothermal  pro- 
file or  the  U-S.  Standard  Atmosphere  temperature  profiles,  shows 
that  all  the  reconstructed  temperature  values  reproduced  very  well 
the  original  profile  in  less  than  seven  iterations  and  the  average 
absolute  error  in  the  reconstructed  temperature,  <AT>^^,  is  less 
than  0.1  K,  irrespective  of  the  initial  guess.  Secondly,  when  a 
small  perturbation  of  the  order  of  1 K was  superimposed  on  the 
exact  profile  and  the  resulting  profile  used  as  an  initial  guess, 
the  solution  converged  with  similar  rapidity.  The  value  of 

<AT>  decreased  from  1 K to  0.074  K in  one  iteration  only.  These 
av 

results  show  clearly  that  the  final  answers  do  not  depend  on  the 
initial  guess  and  that  convergence  is  guaranteed  for  large,  as 
well  as  small,  perturbation  solutions. 

We  have  also  observed  that  the  residuals  R^^^(v.)  of  the 

D 

individual  sounding  frequencies  do  not  converge  simultaneously  at 
the  same  order  n of  iteration.  The  reason  for  this  is  because 
the  absorption  properties  of  the  atmosphere  do  not  usually  allow 
for,  or  lead  to,  the  selection  of  a uniform  set  of  kernels 
K(v^,  z)  with  equal  half-widths  and  equally  spaced  peaks  z ^ . 

From  a numerical  point  of  view,  the  resulting  system  of  integral 
equations  is  poorly  discretized,  and  in  case  the  weighting 
functions  are  also  broad,  it  becomes  difficult  to  resolve  small 
details  in  the  profile  even  in  those  regions  where  the 
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Fig,  3,  Comparison  between  the  exact  temperature  profile  and 
the  reconstructed  temperature  values  for  synthetic  radiance  data 
starting  with  an  isothermal  initial  guess  and  using  linear  inter- 
polations. (From  Ref,  4.) 
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corresponding  peaks  are  narrowly  spaced.  We  note  here  that,  in 
general,  the  resolution  of  small  detail  in  only  one  region  of  the 
profile,  to  the  exclusion  of  the  rest,  is  not  always  possible 
regardless  of  the  accuracy  of  the  measured  data. 

It  is  possible,  however,  to  use  more  than  one  measurement  to 
recover  the  solution  at  one  point  z ^ . This  can  be  done  by  applying 
weighted  scaling  factors  as  in  Eg.  (11) 


3 3 3 


where  each  is  obtained  as  a weighted  average  of  more  than  one 

sounding  frequency  in  a manner  similar  to  Eq.  (10) . 


2.  Stability  of  Solutions 


The  present  relaxation  method  of  solution  is  a discrete 
numerical  process  in  which  convergence  is  judged  according  to  the 
extent  to  which  this  algorithm  suppresses  the  effects  of  quadrature, 
random  and  systematic  errors  on  the  final  temperature  profiles - 


a.  Quadrature  errors.  The  effect  of  quadrature  errors  on 
the  final  answer  depends  on  two  sources:  one  of  these  is  compu- 
tational, resulting  from  the  integrations  of  Eq.  (23)  for  the 
evaluation  of  ^ (^j) ; the  other  is  due  to  interpolations  resul- 
ting from  the  inability  of  a discrete  set  of  points  to  fit  the 
whole  temperature  profile  exactly  even  for  perfect  data.  In  the 
results  typified  by  Fig.  3,  a modified  Simpson *s  rule  was  used  to 
evaluate  Eq.  (23)  with  a first-order  interpolation  formula  for  the 
intemediate  values  of  temperature.  The  temperature  profile  in 
Fig.  3 is  relatively  smooth,  and  the  use  of  a different  inter- 
polation formula  or  a larger  number  of  sounding  frequencies  for 
such  profiles  is  not  warranted. 


b.  Random  errors.  The  question  of  the  propagation  of  random 
errors  is  a critical  one  and  depends  on  a number  of  factors, 
including  the  spectral  region  in  which  the  observations  are  made. 
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In  examining  the  tolerance  of  this  algorithm  to  random  errors,  the 
effect  of  their  distribution,  their  maximum  values,  and  their  rms 
values  will  be  taken  into  consideration. 

To  obtain  a feeling  for  the  stability  of  the  resolution, 
errors  were  produced  by  a uniform  random  error  generator  sub- 
routine; they  were  then  added  to  the  exact  synthetic  data,  and 
inversions  were  performed  for  a variety  of  cases  as  in  the  previous 
section.  In  the  results  shown  in  Fig.  4,  a set  of  10  random  errors 
having  a maximum  value  of  9.3%  and  a rms  value  of  4.8%  was  super- 
imposed on  the  exact  synthetic  data  of  Fig.  1.  The  reconstructed 
temperature  values  show  an  excellent  tolerance  to  random  errors. 

The  average  absolute  error  <AT>^^  in  the  temperature  here  is  1.5  K. 

Since  the  present  inversion  scheme  is  nonlinear,  the  effects 
of  random  errors  must  be  examined  for  each  case  separately-  The 
results  of  some  30  cases  studied  are  summarized  in  Fig.  5.  They 
show  that,  in  observations  made  in  the  4. 3 -ym  region,  a temperature 
accuracy  <AT>^^  of  1 K can  be  expected  with  a 2%  rms  random  error 
in  observations , and  an  accuracy  of  2 to  3 K can  be  expected  with  a 
5 to  7%  rms  error  in  observations. 


The  effects  of  random  errors  in  observations  vary  according 
to  the  frequency  and  can  be  estimated  in  principle  from  Eq.  (16) . 
We  consider  a hypothetical  set  of  sounding  frequencies  for  which 
the  weighting  functions  form  a perfect  set  of  delta  functions - 
The  dependence  of  errors  in  the  temperature  solution  on  random 
errors  can  be  obtained  by  differentiation  of  the  blackbody  func- 
tion with  respect  to  temperature  which  yields 


At 


T^d 


- bv/T 
‘ e 

bv 


) 


AB 

B 


(30) 


Equation  (30)  shows  that  a relative  error  AB/B  in  measuring  the 
blackbody  radiance  will  be  multiplied,  by  the  expression  between 
brackets,  when  the  radiance  is  inverted  to  temperature.  This  ideal 
multiplication  factor  is  a function  of  frequency.  A comparison  of 
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Fig.  5.  Effect  of  random  noise  in  the  data  on  the  accuracy 
of  the  recovered  temperature  profile  • (From  Ref.  4.) 


this  factor  with  the  slope  of  the  least-squares-f it  line  in  Fig.  5 
turns  out  to  be  very  satisfactory. 

Perhaps  more  significant  is  the  effect  of  random  errors  in 

measurements  on  the  behavior  of  the  residuals  . In  the  case 

3 

of  Fig.  3 for  zero  random  errors  in  observations,  we  recall  that 

the  solution  converged  after  a small  number  of  iterations.  The 

variations  of  the  corresponding  rms  value  of  the  residuals, 

<R^  '>  , with  respect  to  the  order  of  iteration,  is  shown  as  the 

rms 

lowest  curve  in  Fig.  6.  By  contrast,  the  uppermost  curve  corre- 
sponds to  the  case  of  Fig.  4 with  an  rms  random  error  of  4.8%. 

A closer  examination  of  the  various  results  shown  in  Fig.  6 
reveals  that  the  residuals  tend  toward  different  asymptotic  values 
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according  to  the  values  of  the  corresponding  random  errors  in 
observations.  For  zero  random  errors,  the  asymptotic  value  is 
equal  to  the  quadrature  errors. 

Thus,  in  the  presence  of  random  errors  in  measurements  the 
residuals  do  not  give  a false  indication  of  convergence;  they  will 
not  tend  toward  zero.  The  residuals  first  decrease  and  then 
approach  an  asymptotic  value  of  the  same  order  of  magnitude  as 
the  errors  in  measurements.  This  property  is  the  result  of  the 
partial  (nonlinear)  dependence  of  several  sounding  frequencies  on 
temperature  variations  at  one  pressure  level,  and  suggests  that 
the  iterative  process  should  be  terminated  when  becomes  equal 

to  the  value  of  rms  errors  in  measurements.  The  effect  of  con- 
tinued iterations  beyond  this  point  does  not  increase  the  amount 
of  information  extracted  from  the  radiance  observations;  it  sim- 
ply increases  the  rate  of  accumulation  of  errors  in  the  recon- 
structed temperature  values. 

The  solid  circles  in  Fig.  4 correspond  to  the  terminal  orders 
of  iteration  at  which  the  average  absolute  error  in  the  recon- 
structed temperature  values  <AT>  is  within  ±0.1  K of  the  mini- 

av 

mum.  The  solid  circles  occur  always  in  the  region  of  maximum 

( n ) 

curvature  of  the  variation  of  with  respect  to  n. 

c.  Systematic  errors.  Certain  transmittance  errors 
resulting  from  an  approximate  knowledge  of  the  composition  and 
the  spectral  properties  of  the  atmosphere  are  systematic  errors. 
The  effect  of  these  errors  on  the  behavior  of  the  residuals  is 
qualitatively  similar  to  the.  effect  of  random  errors;  that  is  to 
say,  the  larger  the  error  in  transmittance,  the  larger  the  corre- 
sponding asymptotic  value  of  the  residuals.  And,  inversely,  the 
residuals  tend  to  their  minimum  value  when  the  errors  in  trans- 
mittance are  minimum.  We  will  show  in  the  following  section  that 
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by  investigating  the  consequences  of  adopting  a criterion  which 

a*  ** 

we  shall  call  minimization  of  the  residuals,  we  can  develop  a 
satisfactory  method  for  the  determination  of  other  meteorological 
parameters,  such  as  the  constant  mixing  ratio  of  absorbing  gases. 

The  mapping  transformation  applied  in  this  section  can  be 
adapted  to  different  data  requirements  as  shown  by  Conrath  (Ref. 
8),  Smith  (Ref.  9),  Shaw,  et  al.  (Ref.  10),  and  Taylor  (Ref.  11). 
Figure  7 is  an  example  taken  from  Ref.  10  and  shows  a comparison 
between  the  temperature  profile  recovered  from  real  data  with 
colocated  radiosonde  and  rocketsonde  data.  More  recently,  the 
technique  has  been  applied  by  Jastrow  and  Halem  (Ref.  12)  to 
interpret  the  infrared  radiance  data  from  the  15  ym  sounding  on 
the  National  Oceanic  and  Atmospheric  Administration  (NOAA)  satel- 
lites for  numerical  weather  prediction  purposes. 

IV.  DETERMINATION  OF  COMPOSITION  PROFILES 

The  dependence  of  the  radiative  transfer  equation  on  the  con- 
centration of  absorbing  gases  Pg(z)  appears  in  the  kernel  of  the 
equation  as  shown  in  Eqs.  (17) , (19) , and  (23) . If  the  mixing 
ratio  profile  is  a constant  q^,  the  dependence  of  K(v,  z)  on  q^ 
is  simple  and  the  retrieval  of  q^  can  be  obtained  by  a minimiza- 
tion process.  However,  if  the  mixing  ratio  is  a function  of  z, 

the  unknown  p (z)  will  appear  as  a functional  in  the  integral 
s 

equation,  and  the  retrieval  of  p (z)  in  this  case  becomes  more 

s 

difficult.  We  will  examine  these  two  cases  in  this  section. 


A.  Case  of  the  Constant  Mixing  Ratio 

The  property  of  the  residuals  given  in  Eq.  (9)  and  Fig.  6 
has  been  used  in  Ref.  4 to  derive  the  constant  mixing  ratio  q of 
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Fig.  7-  Variations  of  the  rms  residuals  with  respect  to  the 
number  of  Iterations  for  different  noise  levels  In  the  radiance 
data,  (From  Ref,  4.) 

absorbing  gases,  such  as  the  constant  mixing  ratio  of  carbon 
dioxide  in  the  terrestrial  atmosphere.  Here,  an  error  in  the 
value  of  the  mixing  ratio  used  in  computing  the  kernel  intro- 

duces  an  error  in  P)  which  will  prevent  the  residuals  from 

converging  to  near  zero.  The  residuals  will  reach  an  absolute 
minimum  value,  only  when  the  correct  kernel  is  used,  i.e.,  when 
the  correct  mixing  ratio  is  known,  assuming  all  other  sources  of 
error  to  be  relatively  small. 
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As  an  illustration,  we  applied  this  method  to  the  synthetic 
data  of  Fig.  3,  assuming,  however,  the  value  of  q^Q  (which  is 
equal  to  462  x 10  ) to  be  unknown.  The  results  in  Fig.  8 of  the 

twelfth  iteration  clearly  show  that  the  residuals  have  one  minimimi 
at  the  correct  value  of  the  mixing  ratio.  The  results  of  the 
third  iteration,  for  which  T(z)  is  far  from  having  converged,  show 
that  a good  approximation  to  the  value  of  the  mixing  ratio  can.be 
obtained  with  just  a rough  knowledge  of  the  temperature  profile. 


B.  Case  of  the  Variable  Mixing  Ratio 

The  determination  of  the  composition  profile  Pg(z)  can  be 
obtained  by  applying  a mapping  transformation  similar  to  the  one 
used  for  T(p).  However,  the  relaxation  equation  required  to  trans- 
form the  I axis  into  the  p axis  may,  in  some  cases,  be  hard  to 

s 

express  analytically.  According  to  Eq.  (7) , the  relaxation 
approach  can  be  generalized  to  solve  for  any  function  or  functional 
under  the  sign  of  integration.  If  g(Zj)  is  the  temperature  pro- 
file then  a.  can  be  obtained  directly  from  Eq.  (26).  But  in  the 
case  of  the  composition  profile,  the  deteimination  of  a . is  more 

difficult  because  p (z)  appears  as  a functional  in  the  kernel, 

s 


K(Vj,  Zf  "'^p^(z)^,  ...) 


9t(v.,  z,  <p  (z)>,  — ) 
_ J s 

9z 


since  x and  K depend  on  the  distribution  of  Pg(z)  between  z and  z. 

The  notation  <p  (z)>  indicates  that  the  transmittance  and  the  ker- 
s 

nel  are  functionals  of  p (z) . 

I.  The  General  Approach 

To  determine  p^(z),  let  us  first  integrate  Eq.  (23)  by  parts 
and  write  the  result  as  in  Ref.  5 


rz 


I(Vj)  = B[Vj,  T(z) ] 


t[v  , z,  <p  (z)>,  ...]  dz  (31) 
3 s dz 
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Fig.  8.  Determination  of  the  constant  mixing  ratio  of  an 
absorbing  gas  by  the  criterion  of  minimization  of  the  residuals. 
(From  Ref.  4.) 


To  determine  p (z.)  from  a given  set  of  radiance  measurements, 
s J 

assuming  that  the  temperature  profile  is  known  and  is  not  iso- 
thermal, we  map  the  v.  axis  into  the  z.  axis  according  to  Eg.  (5) 

3 ( n)  ^ 

then  make  an  initial  guess  \z  .)  and  solve  the  equation 

® 3 


I (v J - B[v^ , T(z) ] = 


rz 


•'z 


t[v.,  z,  <a P (z  . ) > , ...]  dz  (32) 
D j s 3 dz 


to  obtain  a set  of  scaling  factors  ^ for  j = 1,  2 . . . j.  We 

generate  the  next  iteration  through  the  relaxation  equation 


(n  + 1)  , - (n)  ^ (n)  , . 

p (z .)  = a.  p (z. ) 

s 3 3 s X 


(33) 


This  iteration  process  is  repeated  until  each  value  of  the  scaling 
constants  approaches  unity,  which  is  equivalent  to  satisfying  the 
residuals  in  Eq.  (9) . This  relaxation  method  of  solution  leads  to 
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accurate  determination  of  composition  profiles  without  any  a priori 
infomation  for  the  expected  solution,  as  shown  in  Fig,  9. 

The  relaxation  method  can  be  applied  in  conjunction  with  any 
interpolation  formula.  The  extent  of  interpolation  is  dictated 
by  the  quadrature  requirements,  and  by  the  need  to  optimize  the 
quality  of  solutions  obtained  from  a finite  set  of.  sounding  fre- 
quencies. Additional  details  on  this  subject  can  be  found  in 
section  2 of  Ref.  5 and  in  section  4 of  Ref.  6. 


2.  Approximate  Relaxation  Equation 


The  determination  of  can  be  rather  time  consuming  because 

it  requires  reevaluation  of  x many  times.  Two  approximate  relax- 
ation equations  have  been  derived  in  Eqs.  (13)  and  (17)  of  Ref.  5. 
We  give  here  one  approximation  to  the  relaxation  equation 


a . 
3 


(n) 


= 1 - 


i(v^)  - (v^) 


fZ 


> 0 


(n)  , (n)  8B  , 

T In  T T—  dz 
oZ 


(34) 


where  x and  B are  functions  of  both  v and  z.  Equation  (34)  proved 
to  be  very  useful  particularly  when  a is  close  to  unity. 


V.  REMOTE  SOUNDING  IN  THE  PRESENCE  OF  CLOUDS 

Equation  (23)  is  derived  for  the  case  of  plane,  parallel, 
homogeneous  and  nonscattering  atmospheres.  This  is  an  ideal  case 
which  does  not  usually  apply  to  observations  in  the  presence  of 
clouds  or  other  horizontal  inhomogeneities.  In  this  section,  we 
treat  the  problem  of  remote  sounding  of  cloudy  atmospheres  for  the 
determination  of  the  "clear  column"  vertical  profiles,  i.e.,  the 
vertical  temperature  profiles  in  the  clear  portions  of  the  fields 
of  view.  We  will  separate  the  problem  into  two  parts.  The  first 
part  deals  with  the  simple  case  of  a single  cloud  layer  or  a single 
degree  of  horizontal  inhomogeneity  and  the  second  part  deals  with 
the  general  case  of  multiple  cloud  layers.  The  treatment  of  this 
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Fig.  9.  Determination  of  the  variable  water  vapor  profile 
from  synthetic  data  with  random  noise.  (From  Ref.  5.) 


problem  in  this  section  will  be  brief  and  will  describe  only  the 
results  which  have  been  published  in  Refs.  13  to  16.  Additional 
work  on  this  problem  is  still  continuing. 

A.  Observations  in  the  Presence  of  a Single  Cloud  Layer 

In  this  part/  we  treat  two  cases  of  clouds-  The  first  case 
is  general  and  requires  no  a priori  knowledge  of  the  radiative 
properties  of  the  clouds.  The  second  case  is  for  clouds  with 
known  properties/  such  as  black  or  gray  clouds. 

i.  Clouds  with  Spectrally  Unknown  Characteristics 

We  consider  two  adjacent  fields  of  view  having  different 

fractional  cloud  covers  at  the  same  height  z as  shown  in  Fig.  10. 

c 

We  express  the  outgoing  radiance  (v)  and  l2(v)  from  the  first 
and  second  fields  of  view  as 

i,(v)  = •••) 

JjCv)  - HjGCv,  z^,  t,  r,  e,  ...) 
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'CLEAR^*'^  -Tj(i')  = Nj  G (z/,  Z,  t,  e, /»,  . .) 

= N^Gd/,  1,\,Q,P,..) 

Fig.  10.  The  single  cloud  layer. 

Equation  (35)  makes  no  assumption  about  the  radiative  transfer 

properties  of  clouds;  it  simply  states  that  the  observed  radiance 

ij^(v)  is  equal  to  the  clear  column  radiance  which  would  have  been 

measured  in  the  absence  of  clouds  minus  the  radiance  G "obscured" 

by  the  presence  of  a fractional  cloud  cover  N^.  G is  unknown  and 

depends  onv,  z , and  the  spectral  properties  of  the  clouds, 
c 

If  the  two  fields  of  view  are  small  and  contiguous,  we  can 
assume  that 

i(v)  = It (V)  , = Ip(v)  , 

J clear  [ ^ J clear 

and  substitute  into  Eq.  (23)  to  eliminate  G and  get 

i(v)  = i^(v)  + n [ijCv)  - i2(v)]  (36) 

where  n = unknown  = — “ 

Thus,  if  n is  known,  we  can  reconstruct  the  clear  column  radiance 
according  to  Eq.  (36)  for  all  frequencies  and  proceed  to  recover 
T(z)  as  described  earlier. 
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According  to  Eq.  (36)  , ti  can  be  determined  from  a knowledge 
of  I(v)  at  any  frequency , say  v* , as 

I(v')  - i^(V) 

" Ii(v')  - i^(v') 

However,  the  exact  value  of  the  clear  column  radiance  I(v’)  is 
actually  unknown  because  I(v')  itself  depends  on  T(z) . Thus,  the 
determination  of  n and  T(z)  should  be  carried  out  simultaneously 
and  the  selection  of  v*  should  satisfy  certain  convergence  cri- 
terion in  order  to  ensure  uniqueness  of  the  solution. 


The  argument  for  the  selection  of  v'  is  as  follows:  if 

T^  ^ (z)  is  any  guess  on  the  solution  and  I (v’)  is  the  corre- 
sponding radiance  according  to  Eq.  (23),  we  get  from  Eq..  (37). 


(V-)  - (V-) 

ii(v')  - i2(v') 


(38) 


and  by  adding  and  subtracting  I(v)  from  the  numerator  we  get 
(n) 


n 


= n + 


(V')  - I(v') 


I^(v)  - I^(v) 


= n + ~ 


ii  (V)  - 


= n + 6^"^  (n) 


Now,  in  order  to  minimize  6'  ^ (ri)  , Eqs.  (26)  and  (16)  require 
that  v'  < Vj  where  is  the  set  of  temperature  sounding  frequen- 
cies. By  simple  differentiation  of  the  Planck  function  with 

respect  to  T and  by  substitution  into  Eq.  (26) , we  can  show  that 
(n) 

6^  ^ (n)  is  directly  proportional  to  v for  a given  temperature 

( n ) 

error  At  = T(z)  - t^  ^ (z)  , as 


(V')  _ AB(v'  , T) 
I(v')  ~ B(V'  , T) 


At  the  same  time,  the  frequency  range  v*  should  be  cloud 
dependent  so  that  i^(v')  ^ I^(v')- 

We  can  translate  this  into  a condition  for  convergence  and 
select  V * to  ensure  that 
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Application  of  this  method  to  recover  T(z)  in  the  presence  of 
clouds  has  shown  that  the  solution  exhibits  the  same  characteristics 
as  in  the  case  of  clear  fields  of  view-  See  Ref.  6 for  additional 
details. 

2-  The  Single  Lager  of  Black  Clouds 

For  the  case  of  black  clouds,  the  term  G(v,  z , *..)  in 

c 

Eq.  (35)  can  be  expressed  analytically  as 
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(42) 


G(v,  z ) = - B[v,  T(z  )]t(v,  z ) + B[v,  T(z  )]t(v,  z ) 
c c c o o 


rz 


B[v,  T(z)]K(v,  z)  dz 


where  z^  is  the  cloud-top  height.  By  substituting  Eq.  (42)  into 
Eq.  (35) , and  using  Eq.  (23)  , we  can  express  the  measured  radiance 
in  any  field  of  view  an  integral  function  of  the  clear 

column  temperature  profile  T(z) , the  cloud-top  height  z^  and  the 
fractional  cloud  cover  N/  according  to  Eq.  (1)  in  Ref.  13,  as 

i(v)  = N { B[v,  T(z  )]t(v,  z ) 

c c 


+ 


|*z 

^ B[V,  T(z)]K(v, 
c 


z) 


dz  } 


+ (1  - N) 


+ 


{ B[v,  T(z^) ]t(v,  z^) 
T (z) ]K(v,  z)  dz  } 


(43) 


j{n) 

INITIAL ► 

GUESS 


Fig.  11.  Flow  diagram  of  the  iteration  method  of  solution 
for  the  determination  of  the  dear-column  temperature  profiles. 
(From  Ref.  14.) 
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If  N and  z are  known,  it  is  obvious  that  the  problem  reduces  to 
c 

the  case  described  earlier  in  this  paper.  In  practice,  however, 

neither  z nor  N are  known,  and  a method  of  solution  should  be 
c 

derived. to  account  for  (or  eliminate)  the  effects  of  clouds.  When 
we  solve  Eq.  (23)  for  measurements  made  in  the  presence  of  clouds 
(i.e.,  this  is  equivalent  to  assuming  N = 0 in  Eq.  (43)),  we  obtain 
an  apparent  temperature  profile  T^(z)  which  is  different  from  the 
true  dear-column  profile  T(z)  . The  author  (Ref.  13)  derived  a 
simple  relationship  between  T(z)  and  T(z)  in  the  form  of 

B[v.,  T(z.)]  = B[v.,  T(z.)]  z.  > z ) 

3 3 3 1 DC  / 

B[Vj,  T(Zj) ] = B[v^,  T(Zj) ] y 

+ a{B[v.,  T(z.)]  - B[v.,  T(z  )]}  ( z . < zu  (44) 

D D DC  ^ 3 <^J) 

where  a = N/(l  - N)  and  B is  the  pianck  function  given  in  Eq.  (16). 

Let  us  consider  next  two  adjacent  fields  of  view  having  dif- 
ferent amounts  of  clouds,  and  / at  the  same  height  z . We 

1 ^ c 

can  eliminate  B[v.,  T(z  )]  from  Eq.  (44)  and  write  (see  Eq.  (12) 

D c 

in  Ref.  13) 

B[v^,  T(Zj) ] = B[Vj,  Tj (Zj) ] 

+ n{B[v^,  T^(Zj)]  - B[Vy  T^CZj)]}  (45) 

The  practical  benefits  of  Eq.  (45)  are  obvious.  The  clear  column 
temperature  profile  T(z)  can  now  be  recovered  directly  by  a simple 
transformation  of  the  apparent  temperature  profiles  (z)  and  T^(z) 
of  two  adjacent  fields  of  view.  n is  an  unknown  constant  which  can 
be  deteimined  from  an  additional  frequency  as  described  earlier  in 
this  section. 

3.  Determination  of  the  Amount  and  Height  of  Clouds 

The  value  of  the  fractional  cloud  cover  N and  the  height  z of 

c 

clouds  can  be  detennined  when  the  radiative  transfer  properties  of 
the  clouds  in  G(v,  z^,  t,  r,  e,  ...)  are  given. 
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For  the  case  of  black  clouds,  the  author  (Ref.  4)  showed  that 
by  substituting  Eq.  (43)  into  Eq.  (35)  and  by  taking  the  ratio  for 
two  different  sounding  frequences  and  v^,  we  get 

- i.(Vi)  Z ) 

-T-T = -7-T ^ (46) 

(V2)  - Ik(V2>  G (V2,  z^) 

where  Eq.  (46)  is  now  a function  of  one  unknown  z , assuming  that 
. . c 

T'  ' (z)  has  been  determined  as  described  earlier.  The  solution  for 
z^  can  be  obtained  by  minimization  techniques  as  described  in 
section  6 of  Ref.  6.  The  corresponding  value  of  is  obtained 
directly  from  Eq.  (35)  as 


G(v,  z ) 
c 


(47) 


Applications  of  this  approach  to  the  infrared  data  from  the  Vertical 
Temperature  Profile  Radiometer  (VTPR)  sounder  on  NOAA  4 are  illus- 
trated in  Fig.  12. 


B.  Observations  in  the  Presence  of  Multiple  Cloud  Foimations 

The  two  methods  described  earlier  for  the  elimination  of  the 
effects  of  a single  layer  of  clouds  can  be  extended  to  multiple 
cloud  formations  for  the  cases  of  clouds  with  spectrally  unknown 
characteristics  and  for  black  clouds.  Derivation  of  the  required 
equations  can  be  found  in  Ref.  14.  A brief  summary  of  the  final 
results  will  be  given  next. 

i.  Clouds  with  Spectrally  Unknown  Characteristics 

Let  us  reexamine  Eq.  (35)  now  and  look  at  NG,  on  the  right- 
hand  side  of  the  equation,  as  a one  term  expansion  of  the  dif- 
ference between  the  clear  column  radiance  I(v)  and  the  radiance 
measured  in  the  presence  of  clouds  I(v).  In  the  case  of  Eq.  (35) , 
N is  just  a coefficient  of  expansion  and  G(v,  z,  t,  r,  e , ...)  is 
the  expansion  function.  The  mathematical  form  of  G need  not  be 
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Fig.  12.  Meridional  profiles  of  zonally  averaged  cloudiness 
from  the  VTPR  sounder  on  the  NOAA  4 satellite , for  the  period  of 
January  1-7,  1975. 
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defined  because  it  will  be  eliminated  by  using  measurements  from 
adjacent  fields  of  view  as  shown  in  Eq.  (35)  . A knowledge  of  the 
fonn  of  the  function  G is  necessary,  however,  if  we  need  to  deter- 
mine the  amount  of  the  corresponding  fractional  cloud  covers. 

The  one-term  expansion  of  Eq.  (35)  may  not  be  sufficient  in 
the  presence  of  multiple  cloud  formations.  Therefore,  we  propose 
to  use  a three-term  expansion  of  the  form 

(V)  - I (V)  = .n;  g'  + < g"  + <■  G"’  (48) 

k k k k 

with  G*  = G*  (v,  z,  t*  , r*,  e*,  ...)  and  similarly  for  G”  and  G*”  . 

The  expansion  functions  G depend  on  v and  z as  well  as  on  the  cloud 

transmissivity  t,  reflectivity  r and  emissivity  e.  N',  N'*  and  N*" 

are  the  expansion  coefficients.  The  next  step  now  is  to  eliminate 

G from  Eq.  (48)  and  express  I(v)  as  a function 

and  N"'  . 
k 

In  order  to  eliminate  the  expansion  functions,  we  consider 

observations  over  four  adjacent  fields  of  view,  K = 1,  2,  3,  and  4, 

and  use  the  first  three  equations  to  express  G*,  G**  and  G"*  as 

functions  of  the  N,  , I (v)  and  I,  (v) . We  substitute  the  results 
k k 

into  Eq.  (48)  for  k = 4 and  write 

I(v)  = i,(v)  *ll,  ^ n,[  i,(v)  - ^ ^(v)]  (49) 

Details  of  the  substitution  are  given  in  Appendix  A of  Ref.  15. 
As  expected,  we  note  that  the  first  term,  = 1 , of  Eq.  (49)  corre- 
sponds to  the  case  of  a single  cloud  layer  as  given  in  Eq.  (36) . 

The  determination  of  T(z)  and  is  carried  out  simultaneously 
by  iterations  according  to  the  steps  given  in  Fig.  11.  A typical 
illustration  of  the  accuracy  of  this  method  is  shown  in  Fig.  13, 
using  synthetic  radiance  data.  Additional  details  on  the  accuracy 
and  stability  of  the  solution  are  given  in  Ref.  14.  Experimental 
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Fig.  13.  Determination  of  the  dear-column  temperature  pro- 
file in  the  presence  of  three  cloud  layers^  (From  Ref.  14.) 

verification  of  the  expansion  approach  has  been  obtained  and  dis- 
cussed in  Refs.  15  and  16. 

We  conclude  the  discussion  here  by  indicating  that  the  use  of 
more  than  four  expansion  terms  is  feasible  from  a mathematical 
point  of  view,  but  from  an  experimental  point  of  view  the  expansion 
may  not  converge  always  because  of  the  effects  of  noise  in  the 
data  and  uncertainties  in  the  accuracy  of  the  computed  kernels. 

2-  Multiple  Layers  of  Black  Clouds 

In  the  case  of  black  clouds,  Eq.  (45)  can  be  extended  to  read 

B[v^,  T(Zj) ] = B[v^,  (Zj) ] 

+ f T^(Zj)]  - B[Vj,  ^(Zj)]}  (50) 

^ “ 1 
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where  T,  (z.)  is  the  apparent  temperature  profile,  over  the  kth 
3 

field  of  view  (obtained  without  accounting  for  the  effects  of 
clouds) . The  coefficients  ri^  should  be  determined  from  additional 
radiance  data  measured  at  different  frequencies  over  the  same  k 
fields  of  view  or  from  a priori  knowledge  of  certain  properties  of 
the  solution,  such  as  the  value  of  T(z)  or  the  lapse  rate  dT/dz 
at  given  heights  z. 


VI.  APPLICATIONS  AND  EXTENSIONS 

The  mapping  transformation  method  described  in  this  paper  has 
been  presented  specifically  as  a method  of  solution  of  the  non- 
linear radiative  transfer  equation.  However,  it  must  be  obvious 
that  the  mapping  transformation  is  general  and  can  be  applied  to  a 
wide  class  of  nonlinear  as  well  as  linear  integral  equations,  as 
in  the  case  of  the  analytical  example.  The  only  requirement  is 
for  the  kernel  K(v,  z)  to  be  a rapidly  decaying  function  with 
maxima  occurring  at  different  values  of  z for  different  values  of 

V- 

Extensions  of  this  method  and  applications  to  other  linear 
and  nonlinear  problems  can  be  found  in  the  works  of  Barcilon 
(Ref.  7),  Menzies,  et  al . (Ref.  17),  Twomey,  et  al . (Refs,  18  and 
19),  Twitty  (Ref.  20),  Grassl  (Ref.  21),  Gautier,  et  al - (Ref,  22), 
Encrenaz,  et  al . (Ref.  23),  and  Gille,  et  al,  (Ref.  24). 


SYMBOLS 

a constant  defined  in  Eq.  (16) 

b constant  defined  in  Eq.  (16) 

B Planck  function 

c subscript  denoting  clouds 

E convergence  criterion 

g function  to  be  determined 

G radiance  from  cloudy  portion  of  fields  of  view 
I radiance  function 
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I outgoing  radiance  measurement 

k absorption  coefficient 

K kernel  of  integral  equation 

n number  of  iterations 

N fractional  cloud  covers 

N integral  operator 

p Pressure 

R residual  function  defined  in  text 

s subscript  denoting  absorbing  gas 

T temperature 

T apparent  temperature  profile 

z geopotential  height 

z,z^  lower  and  upper  limits  of  the  range  of  integration 

W relaxation  weight 

a scaling  factor  Eq.  (7) 

a weighted  scaling  factor  Eq.  (10) 

a*  line  half  width 

e random  error 

r|  cloud  coefficients 

V frequency,  cm  ^ 

p air  density 

T atmospheric  transmittance 

(p  instrument  function 
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DISCUSSION 


Rodgers:  Can  you  produce  an  estimate  of  the  error  in  your  final 

solution  using  the  relaxation  method?  I mean  the  total  error. 

Chahine:  This  is  a nonlinear  equation  and  it  is  very  difficult  to 
determine  explicitly  the  effects  of  noise  in  the  data  on  the 
accuracy  of  the  final  solution.  By  trying  several  cases  of  noise 
levels,  it  becomes  possible  to  describe  the  growth  of  errors  in 
the  solution  as  a function  of  noise  in  the  data.  Professor 
Barcilone^  has  shown  that  the  growth  of  errors  in  the  final  solu- 
tion is  cyclical.  In  other  words,  he  has  shown  that  the  growth 
of  errors  in  the  final  solution  does  not  vary  linearly  with  the 
growth  of  noise  in  the  given  function. 

Rodgers:  This  isn't  quite  what  I meant.  I meant  the  departure  of 

your  solution  from  the  real  profile.  I didn't  mean  the  sensi-* 
tivity  of  your  solution  on  noise  in  the  measurement. 

Chahine:  That  depends  on  the  degree  of  discretization  of  the  given 

function  or  in  other  words  on  the  information  content  of  the  given 
data.  It  depends  also  on  any  a priori  information  available  about 
the  expected  solution. 

Gille:  When  you  iterate  and  drive  the  radiance  residuals  down, 

they  drop  monotonically . Does  the  error  of  your  temperature 
solution  also  drop  monotonically? 

Chahine:  During  the  first  few  iterations,  the  errors  in  the  solu- 

tion decrease  monotonically  with  the  radiance  residuals.  However, 
when  the  decrease  in  the  radiance  residuals  becomes  slow^ 

the  accuracy  of  the  solution  becomes  dependent  not  only  on  the 
but  also  on  its  first  and  second  derivatives.  Thus,  at  higher 
iterations,  beyond  the  point  of  maximum  curvature  in  the  Figure, 
the  errors  in  the  solution  do  not  necessarily  decrease  monotonically 
with  r(^) ; and  very  often  they  don't. 

Fleming:  I would  like  to  take  exception  to  your  statement  that 

your  solution  does  not  depend  on  the  initial  profile.  If  the 
Chairman  will  permit  me,  I have  a slide  to  illustrate  my  point. 


^See  Ref.  7 in  this  paper. 

2 . . . 

Pointing  to  Fig.  7. 
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DEPENDENCE  OF  THE  CHAHINE  SOLUTION  ON  THE  INITIAL  PROFILE 


This  is  a VTPR  simulation  study  in  which  we  have  107  retrievals  by 
the  Chahine  method  on  the  right-hand  side.  The  arrows  show  the 
locations  of  the  peaks  of  weighting  functions  and  the  dashed  line 
is  the  RMS  error  of  the  solutions  when  forecasts  were  used  for  the 
initial  approximation.  The  solid  line  is  the  RMS  error  of  the 
solutions  when  climatology  is  used  for  the  initial  approximation. 
Notice  that  there  is  a definite  dependence  of  the  solution  on  what 
is  used  as  the  initial  approximation.  On  the  left  is  the  same 
situation,  except  the  retrieval  method  is  the  Rodgers-Strand- 
Westwater  statistical  solution.  Clearly,  the  Chahine  method  shows 
a dependence  on  the  first  guess  virtually  of  the  same  magnitude  as 
the  Rodgers-Strand-Westwater  method. 

Chahine:  I agree  with  you  here  because  the  slide  I presented  on 

the  VTPR  showed  very  strong  dependence  on  the  initial  guess.  I 
agree  with  your  results. 

Fleming:  But  you  just  said  and  your  abstract  said  it  is  not 

dependent  on  the  initial  guess. 
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Chahine:  That  depends  on  the  properties  of  the  kernel  and  the 

given  functions.  In  my  abstract  I was  not  dealing  specifically 
with  the  VTPR  data.  I said  from  a complete  set  of  data  you  should 
be  able  to  get  a complete  solution  independent  of  the  initial  guess. 
But  for  the  case  of  the  VTPR,  the  given  data  are  incomplete  and, 
therefore,  the  solution  is  incomplete  and  depends  on  the  initial 
guess.  For  the  specific  result  you  have  shown  on  the  VTPR  I agree 
with  your  conclusion. 

Susskind:  I would  like  to  make  a comment  on  the  past  two  comments. 

When  we  talk  about  the  dependence  of  the  initial  guess,  there  are 
two  different  things  which  would  cause  dependence  on  the  initial 
guess  that  Dr.  Chahine  was  just  referring  to.  First,  if  we  think 
about  the  effects  of  clouds  and  the  ability  to  retrieve  surface 
temperatures,  if  you  only  have  VTPR  soundings  and  you  want  to 
filter  out  the  effects  of  clouds,  then  you  have  to  return  the  guess 
at  the  surface,  because  there  is  sufficient  information  in  the 
observations  to  constinjct  the  clear  column  radiances.  That's  one 
dependence  of  the  guess.  You  get.  a surface  temperature  that  looks 
like  the  guess,  then  that's  one  kind  of  problem.  Even  if  you  have 
the  clear  column  radiance  and  you. don't  have  to  worry  about  the 
clouds,  the  shape  of  the  profile  to  some  extent  is  what  is  impor- 
tant. You  have  information  at,  say,  the  peaks  of  the  weighting 
functions  of  the  channel,  that  much  information,  and  you  have  to 
make  some  assumptions  as  to  what's  happening  in  between  those 
points.  If  the  shape  of  the  initial  guess  is  wrong,  the  shape  of 
the  solution  will  be  wrong.  The  fine  vertical  structure  of  the 
solution  has  to  follow  the  guess.  If  the  points  come  closer  and 
closer  together  so  that  you  have  more  information,  then  you  could 
say  to  better  approximation  that  the  solution  does  become  inde- 
pendent of  the  guess  because  you're  talking  about  errors  in  shape 
between  points  that  are  very  close  together  and  it's  not — well, 
it's  essentially  linear  in  there.  It  doesn't  matter  what's  hap- 
pening. So  there  are  two  different  things  that  cause  dependence 
on  the  guess. 

Staelin:  Does  your  method  apply  to  certain  other  situations?  You 

began  by  saying  you  might  associate  each  frequency  with  that  alti- 
tude where  the  weighting  function  peaks.  There  are  certain  problems 
where  all  the  weighting  functions  peak  at  the  same  altitude,  how- 
ever. For  example,  if  you  are  attempting  to  sound  temperature  by 
looking  up  from  the  ground  every  weighting  function  peaks  at  the 
surface.  Other  problems  involve  weighting  functions  that  have  two 
peaks.  Does  your  method  apply  to  these  cases? 

Chahine:  Yes  it  does,  but  the  results  are  not  as  satisfactory  as 

for  the  case  where  the  weighting  function  has  distinct  peaks  at 
different  values  of  the  physical  axis.  If  you  cannot  discriminate 
between  the  information  received  from  different  heights,  you  can- 
not have  a satisfactory  inverse  solution.  Professor  Twomey  has 


114 


introduced  a modification  to  the  relaxation  equation  which  applies 
to  the  case  you  have  just  described- ^ 

Twomey:  You  showed  one  of  the  last  graphs  and  there  was  the 

residual  decreasing  to  the  error  level  and  then  flattening  out. 

I have  seen  this  same  behavior  in  algorithms,  and  it*s  kind  of 
spookey,  really.  The  thing  I want  to  ask  though  is  the  following: 
This  seems  to  hold  with  random  errors  and  I have  the  feeling  that 
there  must  be  some  kind  of  error  that  would  be  disastrous  which 
the  thing  would  try  to  invert.  Have  you  found  out  anything  about 
the  character  of  that  kind  of  error? 

Chahine:  Yes.  But  first,  regarding  the  property  of  the  residuals 

shown  here^ , I thought  at  first  when  I noticed  that  the  residuals 
did  not  decay  down  to  zero  in  the  presence  of  noise  in  the  data, 
this  behavior  is  a property  of  the  nonlinear  equations  I was 
studying.  However,  in  examining  the  residuals  obtained  by  Dr. 
Twitty^  for  his  linear  equation,  I found  out  that  the  residuals 
in  his  case  also  decreased  rapidly  at  first  and  then  approached  an 
asymptotic  value  nearly  equal  to  the  errors  in  the  data.  Because 
of  this,  I now  believe  that  this  behavior  of  the  residuals  is  due 
to  the  types  of  overlapping  kernels  under  consideration.  Now  when 
does  this  property  of  the  residuals  break  down?  Usually  it  does 
not  fail  in  the  presence  of  systematic  noise  alone  or  random  noise 
alone.  But  it  sometimes  breaks  down  when  random  and  system  noise 
are  both  large  and  are  both  present  in  the  data. 

Westwater:  On  your  statement  that  the  residuals  converged  to  the 

noise  level,  does  this  convergence  depend  on  the  method  of  inter- 
polation that  you  use  to  fit  the  function  ’*f"?  You  would  think 
that  linearly  interpolating  between  estimated  points  might  con- 
ceivably give  a different  residual  than  a quadratic  interpolation 
or  a higher  order  interpolation. 

Chahine:  Yes,  because  in  the  absence  of  noise  in  the  data,  the 

residuals  decrease  rapidly  and  approach  an  asymptotic  level  equal 
to  the  quadrature  error.  Now,  different  interpolation  methods  will 
introduce  different  quadrature  errors  and,  therefore,  the  residuals 
will  approach  different  values  reflecting  the  level  of  quadrature 
errors  in  the  solution. 

Westwater:  That  would  imply  to  me  that  you’re  really  not  con- 

verging to  the  noise  level,  because  if  you’re  converging  to  a 


^See  Ref. 
2 

See  Fig. 

3 

See  Ref. 


19  in  this  paper. 
7 in  this  paper. 

20  in  this  paper. 


115 


quantity  that  depends  upon  method  of  interpolation  that  you're 
using,  that  really  does  not  bear  a direct  relationship  to  the 
noise  level. 

Chahlne:  The  noise  level  in  the  system  of  equations  is  the  sum  of 

the  systematic  noise,  the  random  noise  as  well  as  the  quadrature 
noise. 
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STATISTICAL  PRINCIPLES  OF 


INVERSION  THEORY 


C-  D.  Rodgers 
Clarendon  Laboratory 


The  accuracy  of  solutions  to  the  inverse  problem  of 
radiative  transfer  is  a topic  that  has  received  very  little 
attention  from  the  theoretical  point  of  view  in  the  meteorol- 
ogical literature , Many  of  the  sources  of  error  are  sta- 
tistical in  nature,  and  statistical  methods  must  be  used  to 
deal  with  them.  Such  methods  give  considerable  insight  into 
both  the  nature  of  the  problem  and  the  nature  of  the  solu- 
tion. 

All  the  available  information  about  an  unknown  profile 
can  be  expressed  in  the  form  of  values  of  functions  of  that 
profile  and  error  estimates  of  these  values.  Estimation 
theory  shows  how  these  values  are  combined  to  give  an  esti- 
mate of  the  unknown  profile  and  its  error  covariance.  Many 
inversion  methods  can  be  expressed  in  this  form,  although 
the  error  estimate  is  not  usually  carried  out.  Practical 
applications  are  described,  both  for  inversion  of  individual 
profiles , and  the  global  analysis  of  satellite  data. 


1.  INTRODUCTION 

This  paper  is  based,  to  a large  extent,  on  a review  paper 
(Ref.  1)  , which  covers^  many  of  the  topics  in  much  more  detail. 

The  subject  is  not  new  by  any  means.  However,  the  implications 
of  a correct  statistical  analysis  of  meteorological  inverse  prob- 
lems do  not  seem  to  be  properly  appreciated,  so  it  seemed  reason- 
able to  try  to  state  them  clearly  at  this  Workshop.  All  physical 
problems  involving  measurements  of  continuous  variables  must  be 
analyzed  by  statistical  methods  simply  because  measurement  error 
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is  statistical  in  nature.  We  can  never  measure  an  exact  value, 
we  can  only  say  that  the  value  lies  in  such  and  such  a range,  or 
belongs  to  a population  with  a known  probability  density  function. 
Any  quantity  derived  from  the  measurement  is  also  a random  variable, 
and  in  developing  methods  for  finding  derived  quantities,  we  must 
take  account  of  this  fact.  There  are  standard  statistical  tools 
available  (e.g..  Ref.  2);  therefore,  it  is  not  necessary  to  invent 
anything  new — it  is  just  a matter  of  recognizing  when  a particular 
tool  is  applicable  to  the  problem  in  hand. 

The  kind  of  question  that  we  would  like  to  answer  is  some- 
thing like  this:  Given  a set  of  measurements  of  radiation  emitted 

(reflected,  scattered)  by  the  atmosphere,  our  current  understanding 
of  the  physics  of  the  atmosphere,  and  any  other  measurements  that 
may  be  relevant,  what  can  we  say  about  the  state  of  the  atmosphere? 
The  question  is  usually  expressed  in  teimis  of  profiles  of  unknown 
quantities,  such  as  temperature  and  composition.  Given  a measure- 
ment of  a known  function  of  a profile,  estimate  that  profile  when 
there  are  experimental  errors  in  the  measurement,  and  errors  in 
our  knowledge  of  the  function.  An  essential  part  of  the  measure- 
ment is  a proper  characterization  of  both  kinds  of  errors.  If  we 
do  not  know  the  magnitude  of  the  error  in  a measurement  or  in  the 
theory,  the  measurement  is  not  worth  making. 

A little  thought  shows  that  this  question  can  only  be  answered 
in  a statistical  sense.  Given  a probability  density  function  rep- 
resenting our  knowledge  of  some  function  of  a profile,  find  a 
probability  density  function  representing  our  knowledge  of  the 
profile  itself.  This  applies  to  almost  any  physical  measurement 
of  almost  anything,  but  it  is  a little  more  complicated  in  our  case 
because  the  result  is  a profile,  rather  than  a single  number  to 
which  we  can  assign  error  bars. 

If  we  are  going  to  discuss  general  profiles,  we  should  use 
the  algebra  of  Hilbert  space.  It  seems  to  me  that  the  use  of 
Hilbert  space  notation  is  an  unnecessary  complication  in  this 
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subject.  This  is  not  because  the  algebra  is  more  difficult  than 
the  algebra  of  matrices,  but  simply  that  many  physicists  are 
unfamiliar  with  the  mathematical  jargon,  and  in  the  end  computer 
programs  that  implement  inversion  methods  will  use  the  algebra  of 
matrices-  In  practice,  it  is  not  possible  to  specify  an  absolutely 
general  profile  because  this  would  require  an  uncountable  infinity 
of  numbers.  We  must,  therefore,  deal  with  profiles  that  can  be 
specified  by  a finite  number  of  numbers,  using  some  representation 
or  discretization  of  the  profile.  An  important  consideration  in 
the  choice  of  representation  is  that  it  must  be  indistinguishable 
from  the  true  profile  for  all  practical  purposes-  In  our  case,  the 
radiation  emitted  by  an  atmosphere  with  the  true  profile  must  be 
the  same  as  that  emitted  by  an  atmosphere  with  a profile  given  by 
the  corresponding  representation  well  within  experimental  error. 

At  this  point,  we  run  up  against  the  main  problem  of  all 
inversion  methods-  The  problem  is  ill  posed  (underconstrained). 

We  need  a relatively  large  number  of  numbers  to  construct  a 
reasonable  representation,  and  our  measurement  of  radiation  is 
usually  made  in  a relatively  small  niimber  of  spectral  intervals 
(or  scan  angles  in  the  case  of  limb  sounding) . Even  if  there  were 
no  experimental  error,  there  are  usually  not  enough  measurements 
to  determine  a profile  uniquely-  The  presence  of  experimental 
error  simply  makes  the  problem  worse. 

II.  A STATISTICAL  APPROACH 

The  only  way  of  solving  the  problem  is  by  making  use  of  some 
kind  of  extra  a priori  information.  (The  alternative  is  to  find 
a different  problem  to  solve.)  The  source  of  a priori  information 
may  be  the  physics  of  the  problem,  statistics  of  other  measure- 
ments, arbitrary  restrictions,  prejudice,  etc.  The  information, 
itself,  can  take  many  forms,  such  as  representations  with  smaller 
number  of  parameters,  smoothness,  least  squares  deviation  from 
some  a priori  profile,  climatology,  etc.,  but  whatever  the  form. 
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they  are  all  similar  in  nature.  They  contain  (or  purport  to  con- 
tain) information  about  the  unknown  profile,  just  as  the  measure- 
ments do.  An  enlightening  way  of  describing  a priori  information 
is  to  call  it  "virtual  measurements."  They  provide  values  of 
known  functions  of  the  unknown  profile  with  specified  errors  just 
as  the  real  measurements  do.  To  solve  an  inverse  problem,  the 
real  measurements  and  the  virtual  measurements  must  contain  enough 
pieces  of  information  to  determine  all  the  parameters  of  the  pro- 
file with  an  accuracy  adequate  for  the  application,  i.e.,  there 
must  be  enough  virtual  measurements  to  make  the  problem  well 
posed. 

One  way  of  considering  the  problem  in  order  to  develop  some 
physical  insight  is  to  express  it  in  terms  of  N-dimensional  geom- 
etry, with  N = 3 for  the  purpose  of  imagining  what  might  happen. 

If  the  representation  of  the  profile  requires  N parameters,  we  can 
treat  it  as  a point  in  an  N dimensional  "profile  space"  with  the 
parameters  as  coordinates.  A priori  information  can  be  regarded 
as  giving  a value  and  uncertainty  (including  covariance)  to  these 
parameters,  or  equivalently,  can  be  regarded  as  specifying  a 
-region  of  profile  space  within  which  we  believe  the  solution  to 
lie.  This  a priori  constraint  region  may  or  may  not  be  infinite 
in  extent.  In  the  example  in  Fig.  1,  the  a priori  region  is  rep- 
resented by  the  large  ellipsoid. 

The  measurement  consists  of  M numbers,  each  of  which  is  a 
known  function  of  the  profile.  They  represent  a point  in  an  M 
dimensional  "measurement  space,"  which  is  a mapping  of  profile 
space.  Typically,  M <<  N,  and  a point  in  measurement  space  maps 
on  to  a region  of  profile  space.  This  region  is  the  class  of  pro- 
files which  are  consistent  with  the  measurements.  In  Fig.  1,  such 
a region  is  represented  by  the  axis  of  the  infinite  cylinder.  The 
finite  width  of  the  cylinder  expresses  experimental  error.  The 
class  of  solutions  which  is  consistent  with  both  the  measurements 
and  the  constraints  within  the  error  bounds  is  represented  by  the 
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Fig.  1.  Illustrating  the  relationship  in  profile  space 
between  the  a priori  information  (x  ,S  ) , the  measurement 
(K^S^^K)  and  the  solution  (x,S). 


small  ellipsoid  in  Fig.  1.  We  can  see  that  the  main  effect  of  the 
constraints  is  to  determine  those  components  of  the  profile  which 
are  not  determined  by  the  measurement — in  this  case,  position 
along  the  cylinder.  The  aim  of  statistical  inverse  theory  is  to 
determine  the  position  and  extent  of  the  solution  region  of  pro- 
file space,  i.e.,  to  find  the  solution  and  its  uncertainty,  or  to 
characterize  the  class  of  profiles  consistent  with  both  the  real 
and  the  virtual  measurements. 

III.  LINEARITY 

We  can  classify  inverse  problems  according  to  their  degree 
of  nonlinearity  in  terms  of  the  methods  used  to  solve  them  and 
to  carry  out  the  error  analysis.  One  such  classification  is 

1.  Linear:  Linear  problems  can  be  solved  explicitly,  pro- 

vided that  there  are  enough  measurements,  both  real  and  virtual. 
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so  that  the  problem  is  well  posed.  Few  real  problems  are  linear. 

2.  Nearly  linear:  This  class  of  problems  can  be  linearized 

and  solved  with  a few  iterations. 

3.  Moderately  nonlinear:  These  are  sufficiently  nonlinear 

for  an  ad  hoc  method  to  be  required  to  find  a solution  efficiently, 
but  are  linear  enough  within  the  error  bounds  to  ensure  that  the  error 
analysis  can  be  carried  out  with  linear  theory. 

4.  Grossly  nonlinear:  These  are  nonlinear,  even  within  the 

error  bounds  of  the  solution. 

In  many  cases,  considerable  improvement  can  be  gained  by  not 
solving  for  the  profile  directly,  but  by  solving  for  some  non- 
linear function  of  it.  For  example,  if  we  solve  for  a Planck 
function  profile  rather  than  a temperatiire  profile,  in  the  case 
of  4.3  ym  band  or  15  \im  band  temperature  sounding,  the  problem 
becomes  nearly  linear  rather  than  moderately  nonlinear. 

IV.  LINEAR  ERROR  ANALYSIS 

The  purpose  of  error  analysis  is  to  characterize  the  class  of 
profiles  which  are  consistent  with  the  measurement.  Linear  error 
analysis  applies  to  all  except  grossly  nonlinear  problems.  We 
will  assume  that  we  have  somehow  found  a profile  which  is  a 
solution  within  the  error  bounds.  We  will  use  it  as  a lineariza- 
tion point. 

The  measurement  y is  a known  function  F(x)  of  the  unknown 
profile  represented  by  the  vector  of  parameters  x,  with  error 
covariance  S 

y 

y = F(x)  + e Covariance  (e)  = S 

y 

We  assume  that  within  the  error  bounds  of  the  solution,  this 
equation  can  be  linearized  about  our  initial  solution  to 


Denote  the  Frechet  derivative  3'F/9x  by  the  matrix  K,  and  we  can 

'\j 

write : 

y - F(x^)  = ^ (x  - x^)  + e (1) 

The  a priori  information  can  be  linearized  similarly.  For  sim- 
plicity, we  will  only  consider  the  case  where  x is  an  a priori 
value  for  the  profile  x,  with  covariance  S^. 

X = X + I Covariance  (5)  = S^.  (2) 

where  | is  a random  vector. 

There  are  standard  statistical  methods  for  combining  indepen- 
dent measurements  of  the  same  quantity,  such  as  Eqs.  (1)  and  (2) . 
They  are  based  on  expected  value,  minimum  variance  or  maximum 
likelihood  estimators.  The  one  everybody  should  be  familiar  with 
is  the  combination  of  two  scalar  measurements.  If  and  are 
two  independent  measurements  of  an  unknown  x,  with  standard  devia- 
tions a-j^  and  02t  respectively,  then  the  best  estimate  of  x is  x 
with  standard  deviation  a where 

/ -'2  -2 
+ O2  ) 

X = a^(x^/a^  + x^/al) 

i-e.,  the  measurements  are  weighted  inversely  with  their  variances. 
In  our  multivariate  case,  this  generalizes  to 

^ —1  T —1 

S = (S  + K S K)  (3) 

'v  X ^ y ^ 

X - X = s{S~^(jE  - X ) +.K'^S~^(y  - F(X  ))}  (4) 

O x . O Oj  y O 

which  is  in  effect  a weighted  mean  of  two  independent  measurements 
of  X - x^,  weighted  with  inverse  covariance -matrices.  One  estimate 
is  X - x^,and  the  other  is  (y  - F(x^))  where  K*  is  any  matrix 
such  that  is  a unit  matrix.  The  second  estimate  is  not  unique 

because  K*  is  not  unique.  For  the  purposes  of  this  section,  S is 
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the  more  important  quantity,  being  the  covariance  matrix  of  the 
solution  X.  It  contains  information  about  the  uncertainty  in  x, 
it  bounds  the  class  of  acceptable  solutions. 

Equations  (3)  and  (4)  comprise  the  complete  solution  in  the 
case  of  linear  problems  and  define  the  iteration  to  be  carried 
out  in  the  case  of  nearly  linear  problems.  At  each  stage  of  the 
iteration  x^  is  replaced  by  x from  the  previous  stage.  Apparently 
different  solutions  (e.g.,  statistical,  Twomey-Tichenov,  etc.)  only 
differ  in  the  form  of  x and  S^,  which  express  the  nature  of  the 
a priori  constraints.  In  the  case  of  moderately  nonlinear  prob- 
lems, it  may  be  necessary  for  efficiency  to  find  by' using  some 
ad  hoc  procedure,  such  as  Chahine's  method,  but  a final  stage  of- 
linearization  and  error  analysis  should  be  carried  out  according 
to  Eqs.  (3)  and  (4). 

To  understand  the  meaning  of  the  covariance  matrix  S,  we  can 
examine  the  diagonal  elements,  which  comprise  the  "residual  vari- 
ance" or  accuracy  of  the  individual  elements  of  the  solution  pro- 
file. This  examination  is  illustrated  in  Fig.  2 for  an  idealized  case. 
Residual  variance  gives  a useful  rough  estimate  of  the  accuracy  of 
the  solution  profile,  but  it  is  not  a complete  description.  Off^ 
diagonal  elements  of  S are  generally  nonzero,  so  that  there  are 
correlations  between  the  errors  at  different  levels. 

The  form  of  the  errors  may  be  more  easily  understood  if  we 
diagonalize  the  error  covariance  matrix  by  finding  its  eigenvalues 
and  vectors.  The  vectors  form  a set  of  "error  patterns"  whose 
coefficients  are  statistically  independent  from  each  other,  and 
the  values  are  the  variances  of  these  coefficients.  An  illus- 
tration of  such  a set  of  error  patterns  is  given  in  Fig.  3 and 
Table  1.  The  solution  is  uncertain  to  the  extent  of  adding  each 
pattern  to  it  with  a coefficient  which  is  random  and  has  a vari- 
ance given  by  X^.  Note  that  all  the  patterns  have  only  fine  scale 
structure  which  is  on  the  same  kind  of  space  scale  in  all  cases. 
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Fig.  2.  A comparison  of  total  variance  (solid  line)  and 
residual  variance  (dashed)  for  a synthetic  case  using  eight 
weighting  functions  which  peak  on  the  range  from  1.7  to  7.5  scale 
heights f and  a realistic  statistical  covariance  matrix. 
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Fig.  3.  Error  patterns  for  the  same  case  as  Fig.  2. 


TABLE  1 


Eigenvalues  of  the  Solution  Covariance 
for  an  Idealized  Case 


n 

1 

2 

3 

4 

5 

6 

7 

8 

X 

469 

274 

84 

47 

40 

36 

21 

19 

n 

V.  NONLINEAR  ERROR  ANALYSIS 

There  is  no  general  method  of  error  analysis  that  can  apply 
to  all  nonlinear  problems,  just  as  there  is  no  general  solution. 
However,  we  can  outline  a strategy  based  on  the  linear  error 
analysis  which  will  give  a good  indication  of  the  solution  error 
and  the  degree  of  nonlinearity. 

We  must  first  define  what  is  meant  by  a solution  in  the  non- 
linear case.  This  will  be  something  like  the  maximum  of  a likeli- 
hood, or  the  minimum  of  a quadratic  risk  function,  for  example 
X might  be  the  minimum  of 

(X  - x)^s"^(x  - X)  + (y  - F(x))\'/(y  - F(x))  (5) 

X y 

where  we  have  used  linear  a priori  infojrmation  {x,  S^}  and  a non- 
linear observing  system — F(x).  More  general  expressions  can  be 
used.  The  minimization  is  possible  in  principle,  but  I do  not 
wish  to  discuss  how  it  should  be  done.  A linearization  is  pos- 
sible about  X,  and  we  can  define  a matrix  S as  in  Eq.  (3) . The 
linearization  may  not  be  valid  in  the  region  of  profile  space 
defined  by  S,  and  the  only  way  to  find  out  is  to  explore  that 
region  empirically,  or  to  examine  higher  order  terms  in  the  expan- 
sion of  y in  terms  of  x - x.  It  is  usually  much  easier  to  explore 
empirically  than  to  expand  to  quadratic  and  cubic  terms,  A sen- 
sible strategy  is  to  explore  in  the  direction  of  the  eigenvectors 
of  ^ to  a distance  from  x given  by  the  square  root  of  the 
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corresponding  eigenvalue-  In  terms  of  the  illustration  in  Fig.  1, 
we  are  exploring  in  the  directions  of  the  axes  of  the  error  ellip- 
soid, to  a distance  of  one  a.  At  these  new  points  in  profile 
space,  we  evaluate  either  y or  the  risk  function,  both  according 
to  the  linear  approximation  and  according  to  the  proper  expression 
given  in  Eg.  (5) - A comparison  of  the  two  values  will  give  a 
measure  of  the  nonlinearity  within  the  error  bounds  and  an  esti- 
mate of  the  true  error  bounds. 

Using  this  technique,  we  can  distinguish  between  moderately 
nonlinear  and  grossly  nonlinear  problems.  This  should  be  done  in 
the  development  stages  of  an  inversion  method,  even  if  it  is  not 
done  when  the  method  is  in  operational  use. 

Similarly,  the  only  way  of  distinguishing  between  nearly 
linear  and  moderately  nonlinear  problems  is  to  try  a linearization 
algorithm,  and  to  make  some  kind  of  quality  judgment  on  the  number 
of  iterations  required  for  convergence. 

VI.  SEQUflNTIAL  ESTIMATION 

The  basic  concept  of  statistical  inversion  methods  is  one  of 
updating  information.  We  start  with  some  a priori  knowledge  of  a 
profile,  together  with  a measure  of  uncertainty,  as  may  be 
expressed  in  a covariance  matrix.  We  then  measure  something 
related  to  the  profile,  enabling  us  to  make  a new,  updated 
estimate  of  the  profile,  and  an  updated  covariance  metrix- 

Once  this  simple  principle  has  been  grasped,  a host  of  pos- 
sibilities become  apparent.  I will  briefly  describe  three  of 
them. 


1.  It  is  not  necessary  to  update  the  estimate  with  a whole 
vector  of  observations  at  once.  The  measurements  can  be  included 
one  at  a time  as  scalars.  The  updating  can  then  be  carried  out 
without  inverting  a matrix,  because  the  estimation  Eqs.  (3)  and 
(4)  can  be  manipulated  to  give 
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T T -1 

- S K (KS  K + S ) KS 
Xru  ^ X y r\j  X 


(6) 


X - X = S k’^(KS  k'^  + S )~^(y  - F(x  ) + K(x  - x ))  (7) 

xn^'v^xTo  Y o 

If  y is  a scalar,  then  is  a scalar  and  K is  a vector:  thus, 
the  inverse  becomes  a scalar  reciprocal.  After  updating  with  one 

/N. 

component  of  y,  S and  x take  the  place  of  and  x^  for  the  next 
stage.  The  matrix  inverse  is  eliminated,  but,  in  fact,  the  total 
number  of  operations  is  similar  so  that  there  is  no  direct  compu- 
tational advantage.  However,  if  the  order  in  which  the  data  is 
used  is  chosen  correctly,  problems  of  nonlinearity  can  be  reduced 
so  that  the  number  of  iterations  is  reduced.  This  is  simply  a 
matter  of  using  the  more  linear  measurements  first:  thus,  when 

the  time  comes  to  incorporate  the  nonlinear  measurements,  the  cur- 
rent estimate  is  nearer  to  the  solution. 


2.  We  can  make  use  of  the  horizontal  homogeneity  of  the 
atmosphere  in  a very  simple  way.  If  we  have  inverted  a profile  at 
position  n,  obtaining  x^  and  S^,  then  we  can  use  this  in  conjunc- 
tion with  any  other  available  estimate  to  construct  the  a priori 
estimate  at  position  n + 1.  We  could,  for  example,  use  a very 
simple  model  of  the  horizontal  behavior  of  the  atmosphere: 


x^ 


n + 1 


= X + (x 


n + 1 


- 


n 


S = S + As 

n + 1 n 

, o o . . 

where  tx  , S j is  the  a prion  estimate  at  n + 1,  x is  a 

n + 1 n + 1 

climatology,  and  AS  is  a measure  of  the  horizontal  correlations. 

This  simple  model  is  something  like  a random  walk  process-  The 
effect,  in  the  case  of  temperature  sounding  in  cloudy  cases,  for 
example,  is  to  propagate  information  into  a cloudy  region  from  a sur- 
rounding clear  region,  taking  proper  account  of  the  growing  uncer- 
tainty as  we  go  further  from  the  clear  region. 


3.  We  can,  in  principle,  do  a global  analysis  of  fields  of 
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temperature  and  composition  using  sequential  estimation  and  updating 
the  whole  analysis  at  every  measurement  time-  In  practice,  the 
covariance  matrices  required  would  be  prohibitively  large.  However, 
we  can  use  the  principle  to  carry  out  two-dimensional  analyses  and 
then  combine  the  analyses  to  construct  a three-dimensional  field. 

The  method  can  be  used  to  reconstruct  missing  measurements  or  to 
interpolate  objectively  on  to  synoptic  times.  I will  describe  in 
some  detail  how  this  method  has  been  used  to  analyze  Nimbus  5 SCR 
radiances. 

Nimbus  5 is  in  a near  polar  orbit,  and  the  SCR  measures  radi- 
ances from  every  latitude  between  80°S  and  80^N  twice  each  orbit. 

We  assume  that  the  radiance  at  any  latitude  can  be  represented  by 
a Fourier  series  in  longitude  whose  coefficients  vary  with  time: 

r^6 

R(A,t)  = R(t)  + ) a (t)cos(nA)  + b (t)sin(nA)  (8) 

^1  n n 

where  R(A,t)  is  the  radiance  (or  other  quantity  to  be  analyzed) 
at  longitude  A and  time  t.  R(t)  is  the  zonal  mean;a^(t)  and 
are  the  Fourier  coefficients.  Six  wave  numbers  have  been  used  and 
13  coefficients  are  given  because  the  satellite  has  13.4  orbits 
per  day:  thus,  on  a time  scale  of  a day  resolution  of  a finer 
structure  cannot  be  expected. 

We  may  write  Eq.  (8)  in  a vector  product  form 

R(X,t)  = k'^(X)  X x(t) 

T 

where  x is  a column  vector  of  Fourier  coefficients,  and  K is  a 

row  vector  of  sines  and  cosines.  We  operate  on  each  latitude 

independently.  At  time  t^,  we  make  a measurement  of  R(A(t^),t^) 

at  some  longitude  A (t^) . 'Let  us  call  that  measurement  y,  with 

error  variance  a^.  The  available  a priori  information  at  t is 

n 

the  estimate  of  the  Fourier  coefficients  made  at  time  t , the 

n - 1 

last  time  a measurement  was  made  at  the  same  latitude.  Let  us 
call  this  estimate  x^  ^ . We  must  make  some  assumption  about  the 
statistics  of  the  time  evolution  of  the  Fourier  coefficients.  The 
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simplest  assumption  is  a random  walk,  in  which  case  the  a'  priori 

estimate  for  time  t is 

n 


(9) 


t 

n 


(10) 


where  As  is  a measure  of  the  increase  of  our  uncertainty  per  unit 

time.  We  now  combine  the  measurement  y of  K(A  ) x K(t  ) according 

^n  n n 

to  the  linear  version  of  Eqs.  (6)  and  (7) . 


o 

o 

, T_o 

S 

= S 

- S K 

(K  S K 

+ 

n 

n 

n n 

n n n 

o 

_o 

X 

= X 

+ S K 

(K  S K 

+ 

n 

n 

n n 

n n n 

where  the  notation  has  been  simplified  in  an  obvious  way.  Note 

that  the  inverse  is  a scalar  reciprocal,  so  that  the  arithmetic  to 

be  performed  at  each  stage  is  not  excessive.  If  a measurement  is 

missing  at  time  t , then  and  are  the  best  estimate, 
n n n 

This  sequential  estimator  permits  an  estimate  of  the  time 
development  of  the  Fourier  coefficients  x,  and  ensures  knowledge 
of  the  accuracy  of  the  estimation.  It  can  be  used  to  interpolate 
missing  data  and  to  smooth  existing  data  in  an  objective  manner. 
(Incidentally,  the  a priori  information  and  its  covariance  allows 
us  to  detect  bad  data  by  means  of  a 3a  test. ) We  can  also  use  it 
to  reconstruct  global  fields  at  synoptic  times,  simply  by  evalu- 
ating the  Fourier  series  at  the  appropriate  times. 


As  it  stands,  the  method  is  one  sided.  We  can  only  use  the 

measurements  which  precede  t to  estimate  at  t . If  the  method  is 

n n 

being  used  operationally,  this  is  the  best  we  can  do,  because  the 

future  is  not  available  to  us.  In  this  case,  more  attention  should 

be  paid  to  the  "forecast"  from  x to  x^  (Eqs.  9 and  10) . 

n - 1 n 

However,  if  we  are  analyzing  a long  run  of  data  for  research  pur- 
poses, we  would  like  to  use  both  sides  of  the  time  axis.  This  can 
be  done  quite  easily  by  analyzing  the  data  backwards  in  time,  and 
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combining  the  forwards  and  backwards  estimates  in  the  proper  sta 
tistical  manner — by  using  the  reciprocal  of  the  covariance  as  a 
weight . 

Figures  4 and  5 illustrate  the  effect  of  this  kind  of  esti- 
mator in  reconstructing  missing  data  and  smoothing  existing  data 
when  operating  on  a time  series  of  measurements  about  10  days 
long  at  a latitude  of  40^N. 


ORBIT  NUMBER 


Fig.  4.  Reconstruction  and  smoothing  of  Nimbus  5 SCR  chan- 
nel B12  at  40^N — a typical  case.  The  data  points  are  marked  P; 
the  reconstructed  data  is  a continuous  line. 
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Fig.  5.  Illustrating  reconstruction  of  about  l~h  dags  of 
missing  data. 


VII.  DISCUSSION 

All  the  previous  discussion  is  a special  case  of  the  Kalman 
filter  (Ref.  3) , which  is  a particularly  powerful  and  general 
approach  to  the  analysis  of  multivariate  time  series.  Some  gen- 
eralizations  of  the  problem,  such  as  the  case  where  it  is  not  the 
profile,  but  some  function  of  the  profile  (e.g.,  thickness)  which 
is  of  interest  have  not  been  discussed.  Problems  of  this  type  are 
also  amenable  to  Kalman  filtering. 
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SYMBOLS 


a , b 
n n 

F(x) 


K 

'X/ 

K* 

o» 


K 


n 


R 

R 

S 

y 

I 

s 

n 

SO  ^ 
n + 1 

t 
t 
T 
X 


n 


X2 

X 

X 


n + 1 


X 

y 


n 

AS 


n 


^1  ' ^2 


Fourier  coefficients 

known  function  of  the  unknown  profile 
matrix  denoting  the  Frechet  derivative  dF/dX 
any  matrix  satisfying  the  relation  KK*  = unit  matrix 
value  of  K at  time  t 

n 

radiance 
zonal  mean 
error  covariance 
covariance  matrix  of  x 
values  of  S at  position  n 

a priori  estimate  of  at  position  n + 1 
time 

time  at  nth  position 

transpose  of  a matrix 

parameters  of  F (x) 

two  independent  measurements  of  x 

best  estimate  of  x 

values  of  x for  an  idealized  case 

initial  value  of  x 

a priori  estimate  of  x^  at  position  n + 1 

a priori  value  for  the  profile  x 

measured  quantity 

measurement  of  K(X  ) x K(t  ) 

n n 

measurement  of  the  horizontal  correlations 
measurement  error 
random  vector 
longitude 

nth  eigenvalue  of  the  solution  covariance 
standard  deviation 
standard  deviation  of  x 
error  variance 

standard  deviations  for  x^  and  x^r  respectively 
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DISCUSSION 


Susskind:  I just  wanted  to  ask  something  quickly  about  what  you 

have  just  done.  These  radiances — are  these  the  observed  radiances 
or  the  radiances  taking  out  the  effects  of  clouds,  or  what?  I 
mean,  if  you  have  clouds  coming  into  the  picture^ it  is  certainly 
going  to  foul  up. 

Rodgers:  These  are  actually  stratospheric  radiances. 

Susskind:  Okay,  so  you're  not  worrying  about  that. 

Rodgers:  This  isn't  the  Nimbus  5 selective  chopper.  It's  about 

45  kilometers. 

Green:  On  your  first  slide,  you  said  "given  some  knowledge  of  a 

function."  Should  you  have  replaced  the  word  ’-function"  by 
"functional"? 

Rodgers:  Sorry,  yes,  it  is  functional. 

C?reen;  Thus,  the  problem  is  how  to  get  a function  from  a func- 
tional. 

Rodgers:  I was  just  not  being  quite  rigorous. 

Chahine:  Clive,  you  made  a statement  that  we  should  pay  just  as 

much  attention  to  our  actual  measurement  as  we  do  to  virtual 
measurements.  I know  how  to  improve  my  actual  measurements-  I 
have  the  physics.  How  can  I improve  my  virtual  measurements  and 
be  sure  of  that? 

Rodgers:  By  the  same  sort  of  techniques  as  your  actual  measure- 

ments. If  you  have  no  virtual  measurements,  you  just  can't  solve 
the  problem.  You  just  have  to  go  into  some  other  problem.  The 
only  way  of  producing  a profile  is  by  having  enough  virtual 
measurements  from  somewhere.  It  may  be  physics.  I can  produce  a 
virtual  measurement  off  the  top  of  my  head  immediately.  I can  say 
the  temperature  in  the  atmosphere  anywhere  is  going  to  lie  between 
zero  and  500  . I know  it's  not  going  to  help  you  very  much;  it 
reduces  the  variance  a bit.  It  just  makes  it  noninfinite  at  least 
But  it  still  means  the  errors  on  all  the  points  are  going  to  be 
250°. 

Chahine:  For  the  nonlinear  method,  I assume  the  temperature  to  be 

positive  and  real.  But  in  your  case,  you  are  using  a priori  sta- 
tistics. 

Rodgers:  This  isn't  only  statistics.  This  applies  to  any  kind  of 

virtual  measurement. 
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Chahine:  Only  real  physical  data  should  be  used  to  judge  virtual 

data.  That's  because  of  the  high  variability  of  the  atmosphere 
and  clouds.  How  can  you  be  sure  that  the  virtual  measurements 
that  you  have  now  are  good  tomorrow  or  are  good  in  the  presence  of 
a front?  ' 

Rodgers:  I don't  know.  But  in  the  method  you  described  you  have 

virtual  measurements  which  you  haven't  stated  explicitly  and  this 
is  your  interpolation  rule.  That's  virtual  measurement,  which  is 
a very  unrealistic  one  in  fact. 

Chahine:  Not  for  temperature.  Professor  Kaplan  might  like  to 

comment  on  the  assumption  of  constant  lapse  rate  between  two  levels 
in  the  atmosphere. 

Kaplan:  That  is  assuming  you  know  where  the  lapse  rate  changes. 

And  anybody  who  has  worked  on  recording  radiosonde  measurement  of 
temperature  knows  that  the  temperature  can  increase  in  the  atmo- 
sphere in  general  linear  with  height  up  to  the  point  where  the 
lapse  rate  changes.  Of  course,  the  lapse  rate  changes  at  arbitrary 
levels  and  you  have  to  be  able  to  pick  the  point  at  which  this 
comes  in.  I mean,  there  are  assumptions.  You  assume  if  you  pick 
the  height  at  which  you  attribute  a frequency,  measurements  at  a 
frequency,  this  is  a point  at  which  the  lapse  rate  changes  or  you 
fit  to  a polynomial.  But  there  are  these  constraints. 

Rodgers:  But  you  have  got  to  recognize  that  it  is  a virtual 

measurement  of  some  kind,  with  some  variance. 

Kaplan:  Yes,  and  to  the  extent  to  which  your  vertical  resolution 
gets  worse  as  you  get  larger,  this  is  more  and  more  an  error.  As 
you  narrow  your  resolution  this  becomes  much  less  important. 

Chahine:  Have  you  done  an  analysis  to  determine  how  often  you 

hit  and  how  often  you  miss? 

Rodgers:  What  do  you  mean? 

Chahine:  With  your  probability  approach,  how  often  do  you  end  up 

with  the  correct  answer  in  applying  your  technique  to  real  data? 

Rodgers:  Well,  this  doesn't  really  apply.  This  is  not  really 

within  the  scope  of  what  I was  trying  to  describe-  I wasn't 
going  into  techniques  of  how  you  find  x,  I was  going  into  tech- 
niques of  how  you  find  S- 

Chahine:  I wanted  to  see,  in  applying  the  statistical  approach  to 

real  data,  how  often  you  hit? 

Rodgers:  You  mean  how  good  can  your  statistics  be? 
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Chahine : 


Yes. 


Rodgers:  That's  another  question  which  I haven't  really  tried  to 

touch  on  here.  I know  it  is  very  difficult  to  get  a priori  infor- 
mation. Statistics  is  only  one  way.  But  whatever  your  a priori 
information  is,  you  have  to  work  on  it  as  hard  as  you  can. 

Wark:  Have  you  ever  given  any  consideration  to  other  aspects  of 

the  radiance  field?  Namely,  that  we  usually  try  to  solve  for  pro- 
files as  though  they  existed  as  the  only  profile  in  nature,  and 
then  we  go  on  to  another  set  of  radiances  and  try  to  solve  them. 

In  addition  to  this,  we  have  gradients  which  exist  in  the  radiance 
field  and  these  gradients  are  very  strongly  tied  to  the  dynamics 
of  the  atmosphere.  That  is,  the  same  set  of  radiances  in  the  same 
location  at  the  same  time  of  year  can  be  associated  with  quite 
different  profiles,  mainly  because  of  the  gradients  which  occur, 
which  are  the  physical  dynamical  processes  in  the  atmosphere. 

Rodgers:  This  is  why  I have  been  recently  getting  interested  in 

doing  global  analysis  of  radiances  to  try  and  make  an  estimate  of 
the  global  distribution  rather  than  individual  profiles. 

Wark:  But  have  you  tried  to  associate  this  with  the  gradients  in 

the  temperature  field? 

Rodgers:  Not  yet.  My  feeling  about  analyzing  meteorological  data 

is  that  it  should  be  the  meteorological  analyst's  job,  not  ours- 
We  shouldn't  go  through  this  'interface  of  profiles  or  anything 
like  it.  We  should  get  him  good  calibrated  radiances.  He's 
already  doing  an  inverse  problem.  He's  solving  for  the  field  of 
whatever  it  is,  given  certain  measurements  of  something  different. 
Radiances  are  just  another  thing. 

Wark:  That's  right.  We  should  be  giving  radiances  to  the 

meteorologists  and  let  them  inject  them  into  their  analyses. 

Chahine:  What  you  are  asking  for  is  a simultaneous  solution  of 

the  radiative  transfer  equation  and  the  equations  of  motions. 

This  is  a great  aim.  It  isn't  easy. 

Wark:  But  I wanted  to  emphasize  the  point  that  the  solution  for 

profiles  is  not  necessarily  our  aim  here  in  a meteorological 
sense. 

Rodgers:  Sure.  There  are  lots  of  things  you  can  get  out  of  this 

stuff  other  than  profiles. 

Malchow:  A bit  of  a detailed  question  about  your  S.  In  non- 

linear iterative  processes,  how  is  it  supposed  to  be  handled?  It 
seems  to  be  unstable  if  it  is  iterated  within  the  iterative  process . 
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Rodgers:  It  certainly  isn't  unstable.  No. 

Malchow:  I was  wondering  if  you  had  any  experience  with  that 

problem? 

Rodgers:  Well,  § is  trying  to  find  the  small  ellipsoid  in 

Figure  1 here.  Intuitively,  it  is  not  going  to  be  unstable  pro- 
viding you  have  got  the  right  kind  of  a priori  information.  If  S 
is  the  wrong  shape  compared  with  this  cylinder,  then  perhaps  S 
is  going  to  stretch  way  up  the  cylinder.  It  is  a matter  of  getting 
the  right  information  in  and  if  you've  got  enough  information,  S 
is  easy.  If  S is  not  easy,  you  haven't  got  enough  information. 

Delrmendjlan:  This  is  not  a question  but  just  a comment.  Since 

this  is  an  interactive  Workshop,  may  I interject  some  non- 
scientific  thoughts  about  Dr.  Rodgers'  introduction  of  the  word 
"aesthetics."  I like  it  because  "aesthetics"  derives  from  a Greek 
verb  meaning  "to  perceive  with  the  senses."  It  is  a scientist's 
prerogative  to  introduce — and  to  be  governed  a little  by — 
aesthetics  in  his  work.  This  implies  things  like  restraint,  non- 
exaggeration, nonreliance  on  innumerable  assumptions,  criteria  or 
data  banks,  and  so  on.  I would  like  to  make  an  analogy,  if  I may, 
between  sailing,  about  which  I know  some  things,  and  the  use  of 
mathematical  inversion  techniques,  about  which  I know  very  little. 
Some  people  want  us  to  use  more  and  more  instruments  and  electronic 
gadgetry  which  are  supposed  to  help  us  sail  better,  on  the  assump- 
tion that  we  have  no  senses — seeing,  hearing,  sensory  feeling — 
or  judgment  or  "sea-sense,"  A good  sailor  does  use  all  these 
things  to  advantage  for  a successful  voyage.  So,  in  analogy  to 
this,  I feel  that  sometimes  we  tend  to  resort  to  inversion  tech- 
niques too  blindly,  without  using  our  judgment  or  "feel"  about 
handling  a given  problem,  which  may  lead  to  "anti-aesthetic" 
excesses. 
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INVERSE  SOLUTION  OF  THE  PSEUDOSCALAR  TRANSFER 


EQUATION  THROUGH  NONLINEAR  MATRIX  INVERSION 


Jean  I.  F.  King 

Air  Force  Geophysics  Laboratory 


The  upwelling  radiance  from  a plane-parallel  planetary 
atmosphere  viewed  either  in  a limb  or  frequency  scan  depends 
on  the  internal  scattering  and  thermal  state  of  the  atmo- 
sphere. Each  upwelling  profile  is  the  solution  of  a uniquely 
specified,  but  generally  unknown,  pseudoscalar  transfer 
equation. 

Nonlinear  matrix  inversion  operators  have  been  devel- 
oped which,  applied  to  observed  radiances,  infer  maximal 
information  regarding  atmospheric  scattering  parameters  and 
vertical  distribution  of  radiant  sources  and  sinks.  The 
algorithm  has  the  attractive  feature  of  noise  discrimination, 
attributing  instrumental  errors  to  extra-atmospheric  sources. 


1. 


THE  MILNE  PROBLEM 


^DO  MODE^ 


IN  CODON  LANGUAGE 


Assume  the  upwelling  intensity  l(0/y)  to  be  exactly  repre- 
sentable in  y = cos  9 space  by  a linear  expression  and  n - 1 
hyperbolic  components* 


1(0, y) 


const 


y + Q + I 


n - 

a = 


1 

1 


L 

a 


1 + J 


(1) 


Substituting  the  more  convenient  transform  variable 
K = sec  0 = 1/y,  we  have 

L 


1(0,  1/k) 


= const 


+ 2 + yn  - 1 ^ 

<2  ic  = 1 K + K 

a-' 


(2) 


The  positive  definite  character  of  1(0,  1/k)  requires  the 
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hyperbolic  poles,  k = - ” l,2,.,.,n  --  1)  , to  be  negative. 

By  "clearing  of  fractions"  the  upwelling  intensity  can  be 
alternatively  expressed  as  the  quotient  polynomial  with  n roots 
K = - ^'^^i  “ l,2,...,n).  Thus, 


n 


1(0. lA) 

K 


nr 


= const 


i = + 1) 


2n"  - 1 


K^n 


a - 1 


+ 1 


= const  n^ 
1 


n - 1 H(1/k) 

a = 1 ^ K 


(3) 


where  we  have  defined  the  H-function  as 


H(1/k)  = 


n;  ^ i(k  + 1/pJ 
K ^ (K  + K ) 

a = 1 a 


(4) 


Again,  the  positive  definiteness  of  the  upwelling  intensity 
requires  the  polynomial  roots,  k =-l/iJ^(i  = l,2,...,n).,  to  be 
negative . 

The  quotient  polynomial  H-function  can  be  characterized  by 
its  sequence  of  roots  and  poles  along  the  real  axis  in  the  trans- 
form plane.  We  shall  call  such  a linear  array  a codon.  The 
importance  of  this  representation  lies  in  the  fact,  as  we  shall 
see,  that  all  the  radiation  physics  is  implicit  in  the  codon  root- 
pole  morphology. 

Using  -partial  fraction  analysis,  we  are  able  to  express  the 
coefficients  of  the  upwelling  intensity  expansion  as  residues  (or 
pole- strengths)  of  the  H-function  codon  at  the  poles  k = - 
Thus,  we  find  from  Eqs.  (3)  and  (4) 

= const  

K H K 

o 


= const 


k = 1 


K + K 


(5) 


a 
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where  the  residues 


L 

a 


g-Kiaa^-K'"  V 

o a 


H(1/k) 


n?  . (l/y . - K ) 

1 = 1 '•  1 g-' 

H n"  ■ J-(K-  - K ) 

o a 3 = 1 p ot 

7^  ct 


and 


(6) 


H 

o 


= lim  *kH(1/k) 
K ^ 0 


n"?  T y. 

1 = 1 1 a 


a 


The  Q-constant,  the  residue  of  the  double  pole  at  the  origin,  is 
evaluated  after  some  algebra  as 


rn  vn 

® ^i  = 1 ^i 


1 

1 K 

a 


(7) 


II.  CODON  TRANSFER  THEORY 


The  upwelling  profile  can  be  considered  the  externally 
sensed  solution  of  an  internal  transfer  problem.  We  proceed  now 
to  construct,  i.e.,  to  infer,  the  unique  transfer  equation, 
source  function,  constitutive  relation,  and  characteristic  func- 
tion which  are  implied  by  the  observed  intensity. 

We  need  first  an  equation  of  transfer.  Now  the  transfer 
equation  can  be  viewed  as  a conservation  condition  imposed  on  the 
intensity  under  steady-state.  We  shall  show  that  the  transfer 
equation  has  a deeper  origin  as  a codon  identity  in  transform 
space . 

From  Eq-  (2)  we  see  that  the  a component  of  the  upwelling 
intensity  obeys  the  split  codon  Identity 


K 


K + K 


K + K 


= 1 


(8) 


Let  us  consider  this  identity  as  a codon  operator,  viz. 
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K 


(9) 


K 


K + K K + K 


J„(T)  . J„(T) 


and  seek  the  internal  (scalar)  T-space  eigenfunction  which  con- 
verts this  codon  equation  into  the  equation  of  transfer. 

By  defining  the  internal  (vector)  intensity  by 


a 

we  see  by  inspection  that  the  required  eigenfunction  form  is 

“K  T 

J (t)  = L e “ 
a a 


(10) 


(11) 


for  then  we  have  the  equation  of  transfer 


dI^(T,  1/ic) 
Kdx 


= I (T, 
a 


1/k)  - J (T) 
a 


(12) 


We  note  in  passing  from  Eqs.  (2)  and  (11)  that  the  eigenfunction 
J(t)  is  the  inveTse  Laplace  transform  of  the  upwelling  intensity 


J(T)  = L 


1(0.  1/K) 

K 


= const 


-+  Q + I 


n 

a 


, -K  T' 

-L  e 
1 a 


(13) 


Equation  (12) , as  it  stands,  is  a relation  between  the  two 
dependent  variables  I(t,  1/k)  and  J(x) . To  eliminate  one  of 
these,  a second  or  constitutive  relation  is  needed.  This 
expresses  the  source  function,  the  radiation  emitted  at  level  x 
in  the  direction  0 = cos  p,  as  the  sum  of  radiation  incident  in 
all  directions  which  is  scattered  into  0,  viz. 

T 1 

J(x,p)  = 2 ^(M/P*)I(x,p')dp*  (14) 


For  local  thermodynamic  equilibrium,  the  source  emission  is 
uncorrelated  with  the  directions  of  the  incident  beams 
0 * = cos  ’ , in  which  event 
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J(T,y)  ->■  J(t) 


(15) 


so  that  the  eigenfunction  becomes  the  source  function 

J(T)  = ^ i|;(y')I(T,y*)dy'  (16) 

We  can  write,  therefore,  as  the  integro-differential  equation  of 
transfer 


dI(T,l/K) 

Kdx 


- I{ 


T,l/K)  = I /J 


ij;(y)I(T,y)dy 


(17) 


The  finite  exponential  sum  character  of  Eq.  (13)  demands  that 
the  source  function  J(x)  be  compounded  of  radiation  restricted  to 
a finite  number  of  fixed  directions.  This  is  mathematically 
accommodated  by  expressing  the  characteristic  function  as  a finite 
delta-function  sum,  i.e., 

^(Vi)  = - y^)  (18) 

Equation  (18)  implies  that  the  characteristic  function  acts  as  a 
filter  and  permits  only  the  beams  in  the  directions  y=y^=±y^ 
to  participate  in  the  source  function.  The  ID  transfer  equation 
then  becomes 


dl (x,1/k) 
Kdx 


I (x,1/k) 


J(T)  = J ip^I(T,y^) 


(19) 


The  determination  of  the  strength  and  direction  of  the  inci- 
dent beams  is  facilitated  by  taking  the  derivative  of  the  con- 
stitutive relation 


dl(x,y  ) 

- 2 *i 


dx 


(20) 


Sxibstituting  from  Eqs.  (10)  and  (13)^  we  can  write 


-K  X 


_ ^ IVi 

1-)  _KLe 

= 1 a a 2 i 


i-r  - ° ■ 

^a=lal+Ky. 


L e 


-K  X 
a 


(21) 


a 1 
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1 

The  constant  terms  are  equal  in  the  conservative  case  I - 1. 
Interchanging  the  order  of  summation  leads  to 


-K  T 


-K  T 


m - 1 „ . a vn  - 1 „ , a 1 v i 

/ KLe  “A  _KLe  oX  1 T 

^a  = lacx  ^a  = laa  2^.1+K 

1 

which  in  turn  requires 


K p. 
a 1 


(22) 


1 yn 

2 


yn 

= 1 


2 = 1 


1 + K M. 
a 1 


^-1 


1 - K y. 
a 1 


= r" 

ii  = 1 


1 - K^y? 

a 1 


= 1 


(23) 


where  we  have  assumed  y . = - y.  and  il;  . = ip.  . 

-1  X ^-x  X 

We  shall  now  demonstrate  that  is  the  residue  of  reciprocal 
H-functions.  Consider  the  bilaterally  symmetric  T-function 
defined  by 


T(1A)  E 


H(1A)H(-1/k) 


^ n = 1 
K^n 

a = 1 

k2  - k2 

= 1 

2 1 

9 

= 1 + 


= 1 


y?K^  - 


(24) 


Clearly,  we  have  for  k = ± 

T(±  1/K  ) =0  = 1-^"  (25) 

“ ^ ^ 1 - K2y?  , 

a X 

which  shows  that  the  T-function  satisfies  the  requirement  of 
Eq.  (23) . 

The  strength  of  the  beam  at  y = y^  is  given  now  as  the 
residue  of  the  reciprocal  H-functions 
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I 


, lim  . 1 

♦i  ■ " — ^ 


- 


P? 


T(l/ic) 


= K 


lim 


^ (k^  - l/p^) 

vi7  k2h(i/k)h(-  1/k) 


k!) 


^ - 1/Pj) 

i 


(26) 


The  deep  reciprocity  inherent  in  the  codon  formulation  of 
the  radiative  transfer  problem  is  seen  by  comparing  and 
The  coefficients  of  the  exponential  source  function  expansion, 
as  defined  in  Eg.  (13) , namely, 


J(t)  = const 


T + Q + 


rn  - 1 
= 1 


L e 
a 


-K  T>i 
a 


are  identified  as  residues  at  the  poles  of  the  H-function  codon, 
from  Eg.  (6) , namely. 


H L 
o a 


lim 

K 


K 

a 


(k  + K ) 
a 


H(1/k) 

K 


tH 


= 


] 

a 3 = 1 
^ a 


(27) 


In  contrast,  the  weights  of  the  filter  function  which  enter 
the  constitutive  relation,  from  Eg.  (19) , namely, 

J(t)  = j I(T,y^)  , 


are  residues  at  the  poles  of  the  reciprocal  paired  H-function 
codons 


♦ l = K ± ^ 


.2  _ 


1/y? 


^ 

a = 1 


(1/yt 


2 - k2) 

a 


k2h(1/k)H(-1/k) 


^ - 1/y?) 


/ i 


It  is  further  seen  that  a knowledge  of  the  codon  structure,  i.e., 
its  roots  and  poles,  serves  to  specify  all  the  radiation  functions 
and  parameters  of  the  problem.  The  codon  concept  shifts  the 
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emphasis  and  changes  the  character  of  transfer  theory.  Rather 

than  seeking  the  solution  for  a particular  radiation  parameter, 

one  develops  algorithms  for  determining  the  underlying  codon 

structure  from  the  given  data.  Thus,  in  the  forward  problem  in 

which  the  internal  scattering  characteristic  function  is 

known,  Gaussian  quadrature  is  used  to  determine  the  weights 

and  directions  ]j  . . These  determine,  in  turn,  the  constants  K and 
1 a 

and  thus  the  upwelling  intensity.  On  the  other  hand,  in  the 

inverse  problem  we  must  construct  the  codon  structiire  from  the 

observed  upwelling  intensity  profile.  This  requires  a nonlinear 

matrix  inversion  algorithm,  developed  by  the  author  (Ref.  1), 

which  solves  uniquely  and  exactly  the  following  problem;  Given 

2n  upwelling  intensities  I.  = 1(0,  |i.)  sensed  at  the  arbitrary 

-1  ^ ^ 

nadir  angles  0^  = cos  Uj  (j  = 0,1,...,  2n  - 1) , find  the  unique 
2n  constants  C,  Q,  and  the  n - 1 pairs  ^ which  are  speci- 

fied by  the  measurements.  This  is  equivalent  to  the  inversion  of 
the  following  nonlinear  equation  set 


I . 
D 


+ Q + 


yn  - 1 
= 1 


1 + K y. 
a j 


j = 0,1, 


2n  - 1 (28) 


The  Planck  intensity  source  function  follows  readily  from  Eq.  (13); 
namely. 


B(t) 


-K 


T + Q + 


vn  - 1 ^ a 
) ^ L e 

= 1 a 


(29) 


The  inference  of  the  internal  scattering  parameters  becomes  a 

mere  evaluation  of  y . and  . from  the  inferred  constants  L , K . 

11  a a 


III.  SUMMARY,  GENERALIZATION,  AND  PROSPECTUS 

In  this  paper,  we  have  discovered  three  features:  first, 

that  underlying  the  interaction  of  radiation  and  matter,  i.e., 
radiative  transfer  theory,  is  a simple  code;  second,  that  the 
code  consists  of  linear  arrays  of  roots  and  poles  along  an  axis 
in  a complex  transform  space  (which  we  have  called  "codons");  and 
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third,  that  the  code  translates  easily  into  observable  and  infer- 
able radiation  parameters.  Stated  succinctly,  we  have  found  a 
code,  we  have  determined  its  structure,  and  we  have  broken  the 
code. 

In  assessing  the  importance  and  implications  of  this  work, 
we  must  set  forth  how  transfer  theory  differs  from  conventional 
differential  analysis.  The  specification  of  density  distributions 
is  of  great  interest  in  physics,  for  example,  the  fluid  density 
in  the  Euler-Lagrange  equation  or  the  i|;-probability  density  in 
Schrddinger  wave  mechanics.  In  these  and  other  cases,  the  den- 
sities, specified  as  solutions  of  partial  differential  equations, 
are  considered  primal  and,  hence,  not  further  analyzable  into  more 
basic  component  parts.  In  contrast,  the  transfer  equation  through 
its  linkage  between  two  spaces  relates  the  source  function,  and 
energy  density,  to  the  solution  of  an  integro-dif ferential 
equation.  In  this  relation,  the  density  J(t)  is  analyzed  and 
dissected  as  the  discrete  sum  of  a more  primitive  concept,  the 
beam  field  I(x;  M,K)  quantized  in  the  directions  and  e-folding 
lengths 

We  have  seen  that  the  physics  of  the  generalized  transfer 
problem  is  completely  determined  by  the  root-pole  morphology  of 
the  codon.  Further,  the  alternate  root-pole  structure  along  the 
negative  real  axis  of  the  H-codon  in  classical  theory  is  a rela- 
tively restricted  grouping.  It  is  natural  to  inquire  into  the 
physical  systems  implied  by  more  general  codon  patterns. 

We  find  that  there  are  codon  identities  other  than  the 
linear  expression,  Eq.  (8) , which  can  generate  transfer  equations. 
In  particular,  the  quadratic  identity  can  be  used  to  generate  a 
wave  transfer  equation.  The  Planck  intensity,  derived  from  first 
principles,  is  such  an  example-  We  may  ask  what  is  the  condition 
for  the  source  function  to  obey  a differential  equation?  We  find 
this  occurs  if , and  only  if,  the  associated  codon  exhibits  the  group 
property  of  invariance  under  displacement.  This,  in  turn,  occurs 
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if,  and  only  if,  the  codon  roots  and  poles  are  ecjually  spaced-  We 
are  able  to  express  all  the  finite  polynomials  (Legendre,  Laguerre, 
Hermite,  Jacobi)  entering  into  the  ^-functions  of  simple  quantum 
systems  as  inverse  of  such  equally  spaced  (canonical)  codons 
(Ref.  2).  Finally,  it  proves  possible  to  infer  the  relativistic 
radial  (Dirac)  wave  equation  as  the  inverse  of  a split  quadratic 
codon  equation. 
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SYMBOLS 

Planck  intensity  source  function 
H-function  residue  at  origin 

discrete  ordinate  H-f unction  of  Chandrasekhar 

radiant  intensity  at  optical  depth  t at  an  angle 
^ -1 

0 = cos  y with  the  zenith 
source  function 

reciprocal  e-folding  depth  of  radiant  beam 
coupling  constant  between  the  forcing  function  and  the 
medium  modes 

Q = q(") / where  q(x)  is  the  Hopf  q-constant 
discrete  ordinate  T-function  of  Chandrasekhar 
dummy  variables  specifying  particular  values  of  K and 
y , respectively 

independent  variable  specifying  the  complex  trcinsform 
plane 

characteristic  function 
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DISCUSSIONS 


Green:  You  used  quantum  mechanics  as  an  illustration  for  your 

problem.  In  most  applications  of  quantum  mechanics/  the 
Schrodinger  equation  or  the  Dirac  equation  does  not  serve  as  the 
goal  of  the  search.  For  example,  in  the  nuclear  force  problem, 
you  can  assume  that  the  Schrodinger  equation  or  the  Dirac  equation 
is  a basic  law  of  nature  or  the  basic  equation  of  motion  and  you 
might  be  attempting  from  scattering  data,  experimental  accelerator 
data,  to  infer  a basic  law  force  so  that  the  Schrodinger  equation 
or  the  Dirac  equation  has  the  same  role  as  the  radiative*  transfer 
equation  and  the  scattering  data  has  the  role  of  some  experimental 
observation  from  which  you  hope  to  find  a basic  nuclear  force  or 
atomic  force.  Thus,  it  seems  that  if  in  your  analogy  of  your  codons 
you  arrive  at  the  Schrodinger  equation  or  the  Dirac  equation  you 
do  not  have  a parallelism  to  the  real  world  use  of  the  Schrodinger 
equation  or  the  Dirac  equation  as  something  which  connects  scat- 
tering observations  to  some  physical  aspect  of  the  description  of 
your  system.  Which  is  the  law  of  force  governing  your  system 
which  you  would  then  insert  into  the  Schrodinger  equation? 

King:  Let  me  try  to  answer  that.  Perhaps  our  conventional  view 

of  the  Schrodinger  and  Dirac  equations  needs  reassessing.  Let  us 
think  generally.  A codon  identity  quite  literally  is  a separation 
of  unity  into  two  or  more  parts.  The  radiative  transfer  equation 
is  the  transcription  into  our  space  of  this  codon  separation.  The 
radiative  heat  exchange  represents  the  flow  resulting  from  this 
nonequilibrium  partition  arising  from  the  nonisothermal  temperature 
distribution.  The  Schrodinger  and  Dirac  ecjuations  may  be  similarly 
viewed  as  representing  unequal  partitioning  of  the  Psi-function, 
concentrated  in  regions  of  high  electron  expectancy  and  dilute 
elsewhere.  At  a deeper  level,  this  separation  is  represented  in 
the  Dirac  equation  by  a triply  split  codon  identity.  This  identi- 
fication of  the  Dirac  equation  as  an  inverse  codon  identity  has 
the  important  conceptual  consequence  that  nature,  at  least  in  the 
radial  Dirac  equation  describing  the  Kepler  atom,  is  contrained  by 
a law  of  form,  rather  than  a law  of  force - 

Unidentified  Speaker:  Could  you  predict  something  that  isn't 

known? 

King:  Yes.  At  present  no  theory  or  model  exists  for  the  fine 

structure  constant.  It  currently  enters  into  field  theory  as  an 
ad  hoc,  empirically  determined,  externally  imposed  coupling  con- 
stant between  the  electromagnetic  field  and  the  electron.  I hope 
to  model  the  fine  structure  constant  as  a codon  counting  algorithm 
involving  the  harmonic  sum  of  codon  poles. 

Irvine:  I am  a little  bit  perplexed  at  your  unhappiness  with  the 

transfer  equation,  Jean.  You  say  there  are  ambiguities  in  it,  but 
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that  is  true  of  all  physics.  In  every  physical  equation  we  are 
making  a mathematical  approximation  to  reality.  The  best  des- 
cription of  radiation  we  think  we  have  now  would  be  quantxim  field 
theory.  Of  course,  there  are  aspects  of  that  that  are  not  present 
in  Maxwell's  equations.  Likewise,  in  the  transfer  equation,  we 
are  negelcting  information  which  is  present  in  Maxwell's  equations. 

King:  Yes,  I agree  with  you.  What  I should  have  said  was  that  a 

linear  transfer  equation  is  satisfied  only  by  exponential -type 
source  functions.  In  other  words,  only  restricted  classes  of 
source  functions  satisfy  the  linear  transfer  equation.  We  agree 
to  that. 

Jrvine:  You  also  expressed  unhappiness  about  the  fact  that  one 

uses  a local  thermodynamic  equilibrium  approximation  for  the  source 
function.  Of  course,  that  is  not  necessary.  That  is  simply 
because  we  are  ignoring  any  possible  information  on  the  micro- 
physics at  a given  point. 

Twitty:  I guess  I am  either  missing  something  or  I don't  under- 

stand this  relationship  between  the  codons  and  the  equations  you 
have  derived.  There  are  an  infinite  set  of  equations  you  could 
write  down  that  are  identities,  and  one  could  do  these  kind  of 
transforms  on  all  of  those  and  produce  some  infinite  set  of  dif- 
ferential equations  which  hopefully  would  not  describe  anything 
we  know  about  physics.  What  is  so  special  about  the  ones  you  have 
derived? 

King:  The  special  feature  is  that  the  transfer  equation  is  the 

only  inverse  statement  of  a linear  codon  identity.  And  I would 
say  that  quadratic  codon  identities  lead  to  sinusoidal  source 
functions  and  quadratic  transfer  equations. 

Twitty:  But  then  the  Dirac  equation  looks  far  more  complicated. 

King:  It  is  still  a wave  equation. 

Twitty:  No,  I am  not  referring  to  the  Dirac  equation.  Your 

identity  that  gives  it  is  far  more  complicated. 

King:  Yes,  the  reason  for  that  is  the  additional  constraint  which 

codons  must  obey  in  quantum  mechanics.  The  differential  equation 
format  of  wave  mechanics  demands  that  the  roots  and  poles  of  the 
corresponding  codons  in  the  transform  plane  be  equally  spaced. 

These  codon  patterns  associated  with  various  quantum  systems  are 
developed  in  the  paper  cited. 

Twitty:  So  it  seems  to  me  that  somewhere  in  this  identity,  mathe- 

matical identity  is  what  you're  really  talking  about,  there  is  the 
basic  physics  and  you  have  used  some  additional  knowledge  about 
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the  physics  to  say  that  this  identity  corresponds  to  what  we  know 
in  terms  of  the  wave  equation  describing  the  problem. 

King:  Yes.  That  requirement  is  that  the  Psi-function  codon  be 

invariant  under  translation  in  the  transform  plane.  This  invari- 
ance r in  turn,  demands  equally  spaced  roots  and  poles,  namely, 
canonical  codons,  for  superposition - 

Fymat:  I have  a question  and  a comment.  The  question  is:  Have 

you  carried  on  this  work  for  the  case  of  scattering  and  with  con- 
tributions from  single  and  multiple  scattering  in  the  source 
function? 

King:  No,  I have  not.  The  formalism  appears  indifferent  to 

whether  or  not  the  impinging  photon  is  singly-  or  multiply-scattered. 
Perhaps  this  is  not  a disadvantage  inasmuch  as  it  is  difficult  to 
conceive  any  measurement  which  would  discriminate  between  the  two. 

Fymat^ : Tlie  comment  is  that  in  his  book.  Professor  van  de  Hulst 

has  emphasized  that  we  should  use  what  he  called  the  scattering 
amplitude  matrix,  a 2 x 2 matrix  which  contains  information  both 
on  the  amplitude  and  the  phase  of  the  wave.  He  carried  the  theory 
only  for  single  scattering.  On  the  other  hand,  Fano  has  shown  the 
analogy  between  this  treatment  and  quant™  mechanics.  Dr.  Vasudevan 
and  myself  have  pursued  this  analogy  further  in  a paper  we  have 
recently  published. 2 This  article  provides  the  complete  theory  of 
multiple  scattering  for  both  amplitude  and  phase  by  exploiting  the 
close  analogy  between  the  quantum  mechanical  states  of  half-spin 
systems  and  the  polarization  states  of  electromagnetic  radiation. 

The  interest  of  this  new  form^iiation  is  that  it  enables  you  to 
carry  the  multiple  scattering  with  the  phase  information.  Although 
we  are  not  observing  the  phase  in  the  visible,  this  is  possible  in 
other  regions  of  the  spectrum.  The  phase  also  contains 


1 

Dr.  Fymat 's  post-Workshop  comment:  "I  did  not  say  that  the 
scatterer  molecule  or  particle  conserved  any  memory  of  the  scat- 
tering order.  Rather,  I was  concerned  with  the  formalism  that 
your  codon  approach  would  take  had  you  considered  a particular 
scattering  process  and  the  consequent  polarization  it  induces  in 
the  radiation  field.  Here,  the  radiative  transfer  equation  would 
become  four-dimensional  (using,  for  example,  Stoke^ ' representation 
of  the  polarization  state) , and  the  source-function  would  be  far 
more  complicated  than  Planck's  function  as  it  wbuil  receive  con- 
tributions from  both  single  and  multiple  scattering." 

2 

Published  in  Astrophysics  and  Space  Science,  Vol.  38, 
pp.  95-124,  1975. 
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information  about  the  atmosphere.  I thought  you  might  wish  to  use 
this  work  were  you  interested  in  extending  your  codon  approach  to 
scattering  and  polarization. 

King:  Yes,  I see.  All  of  these  methods  must  ultimately  rest  on 

some  conservation  principle. 
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BACKUS "GILBERT  THEORY  AND  ITS  APPLICATION 


TO  RETRIEVAL  OF  OZONE  AND 
TEMPERATURE  PROFILES 


Barney  J . Conrath 
Goddard  Space  Flight  Center 


The  analytical  methods  of  Backus  and  Gilbert  were  orig- 
inally formulated  for  application  to  inverse  problems  asso- 
ciated with  the  physics  of  the  solid  earth • However,  the 
theory  is  sufficiently  general  to  be  applicable  to  many  types 
of  inverse  problems,  and,  in  particular , constitutes  a use- 
ful tool  for  analyzing  the  information  content  of  atmospheric 
profile  retrievals , Basically , the  method  provides  a quanti- 
tative evaluation  of  the  trade-off  between  vertical  resolution 
of  a retrieved  profile  and  formal  root-mean-square  (rms)  error 
due  to  measurement  noise  propagation.  As  one  example  of  an 
application  of  the  theory,  the  problem  of  retrieving  the  top- 
side ozone  profile  from  backscattered  ultraviolet  (BUV) 
measurements  is  considered . For  measurements  of  the  type 
currently  being  obtained  with  the  Nimbus  4 and  AE-E  BUV  experi- 
ments, it  is  found  that  a vertical  resolution  of  approximately 
0,75  scale  height  can  be  achieved  for  a formal  volume  mixing 
ratio  profile  error  of  10%,  Other  examples  include  treatments 
of  the  retrieval  of  temperature  profiles  from  measurements  in 
the  15  \im  CO2  absorption  band  for  both  the  terrestrial  and 
Martian  atmospheres.  Finally , the  method  is  applied  to  the 
problem  of  retrieving  temperature  profiles  of  the  Jovian 
planets  from  measurements  in  the  far  infrared  pressure  induced 
lines  to  be  obtained  from  the  Mariner  Jupiter/Saturn  fly-by 
missions , In  the  latter  example,  the  results  of  the  Backus- 
Gilbert  analysis  are  compared  with  an  analysis  by  Gautier  and 
Revah,  based  on  more  conventional  information  theory  tech- 
niques. 
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I.  INTRODUCTION 


The  method  of  Backus  and  Gilbert  was  originally  developed  for 
application  to  inverse  problems  encountered  in  the  physics  of  the 
solid  earth.  The  theory  was  formulated  primarily  from  a physical 
point  of  view  in  a series  of  three  papers  by  Backus  and  Gilbert 
(Refs-  1,  2,  and  3)  and  summarized  within  a more  formal  framework  by 
Backus  (Ref.  4).  In  addition  to  providing  an  inversion  algorithm, 
the  method  also  provides  diagnostic  information  which  can  be  used 
in  assessing  the  value  of  a given  set  of  measurements.  Although  the 
original  applications  were  in  the  field  of  seismology,  the  theory 
is  quite  general  and  can  be  readily  applied  to  inversion  problems 
encountered  in  atmospheric  physics.  Examples  of  such  applications 
include  those  of  Conrath  (Ref.  5),  Westwater  and  Cohen  (Ref.  6), 
Fleming  (Ref.  7),  Wang  (Ref.  8),  and  Rodgers  (Ref.  9). 

In  the  present  paper,  a review  of  the  method  as  applied  to 
profile  retrieval  in  planetary  atmospheres  is  given.  The  basic 
theory  is  discussed,  and  certain  aspects  are  illustrated  through 
the  use  of  examples.  This  review  will  be  followed  by  the  presen- 
tation of  results  of  a recent  application  of  the  method  to  the 
problem  of  retrieving  high  level  ozone  profiles  from  satellite 
measurements  of  back-scattered  ultraviolet  radiation.  Finally, 
applications  to  problems  of  temperature  profile  retrieval  from 
remote  infrared  measurements  of  the  earth's  atmosphere  as  well  as 
the  atmospheres  of  Mars,  Jupiter,  and  Uranus  are  considered.  In 
the  case  of  Jupiter,  results  are  compared  with  those  obtained  by 
Gautier  and  Revah  (Ref.  10)  who  employed  a different  theoretical 
approach . 

II . THEORY 

Basically,  the  method  of  Backus  and  Gilbett  treats  a general 
set  of  integral  equations  of  the  form 

gi  = / Ki(z)  f (z)  dz  i = 1,  2,  ...  m (1) 


156 


ir 


It  is  assumed  that  there  exist  measurements  of  the  quantities 
and  that  the  K^(z)  are  known  functions  of  the  independent  variable 
z.  The  problem  is  to  infer  information  on  the  unknown  function 
f(z).  In  the  applications  considered  in  this  paper,  the  are 
generally  related  to  radiance  measurements  the  kernels 
are  determined  by  the  radiative  transfer  process  (scattering, 
absorption,  etc.)f  f (z)  is  an  atmospheric  profile,  and  z is  some 
measure  of  height  within  the  atmosphere. 

The  finite  number  of  kernels  K^(z)  do  not  constitute  a com- 
plete set,  in  general,  so  it  is  not  possible  to  obtain  an  exact 
specification  of  f (z)  from  measurements  of  g^.  Nevertheless,  it 
still  may  be  possible  to  specify  certain  useful  properties  of  f(z) 
from  the  available  measurements.  Let  f(z)  be  an  integral  pro- 
perty of  the  profile  associated  with  level  z,  and  assume  it  is  to 
be  obtained  by  a linear  estimate  of  the  form 
m 

f(z)  = I a (z)g  (2) 

i = 1 

where  the  z-dependent  coefficients  a^(z)  are  determined  by  the 
inversion  method  chosen.  A relation  between  f(z)  and  f(z)  is 
obtained  by  substituting  Eq.  (1)  into  Eq.  (2),  i.e., 

f(z)  = / A(z,  z')  f (z')dz'  (3) 

where 

m 

A(z,  z*)  = ^ a (z)K.(z*)  (4) 

i = 1 ^ 

Thus,  the  nature  of  the  estimate  f{z)  is  controlled  by  the  behavior 
of  A(z,  z*),  usually  called  the  averaging  kernel.  The  essence  of 
the  Backus-Gilbert  method  is  to  attempt  to  control  the  shape  of 
A(z,  z')  through  the  choice  of  the  coefficients  a^(z).  Originally, 
effort  was  directed  primarily  toward  making  A(z,  z')  resemble  a 
delta  function  as  nearly  as  a given  set  K^(z)  would  permit.  In 
other  words,  the  goal  was  to  achieve  in  some  sense  the  best 
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resolution  possible.  However,  there  are  some  applications  for 
which  it  is  desirable  to  force  A(z,  z*)  toward  some  other  pre- 
detemined  shape.  For  example,  it  may  be  required  that  f(z) 
approximate  a uniformly  weighted  average  of  f(z)  oyer  some  range 
of  the  independent  variable  centered  on  z,  in  which  case  A(z,  z*) 
should  be  made  to  approximate  a rectangular  function.  In  any  event, 
one  approach  to  controlling  the  shape  of  A(z,  z')  is  to  choose  the 
coefficients  a^(z)  such  that  they  minimize  some  quadratic  form  for 
each  value  of  z,  for  example^ 

Q(z)  = / J(z,  z')  [A(z,  z')  - D(z,  z')]^  dz'  (5) 

where  the  weight  J(z,  z*)  and  the  function  D(z,  z*)  are  chosen  to 
produce  the  desired  behavior  in  A(z,  z*). 

Another  aspect  of  the  problem  which  must  be  considered  is 
measurement  noise.  Any  measurements  of  g^  will  have  associated 
with  them  errors  of  unknown  magnitude.  However,  if  the  statistical 
properties  of  the  measurement  errors  are  known,  the  resulting  sta- 
tistical properties  of  the  errors  in  f (z)  can  be  found.  By  assuming 

that  the  measurements  possess  zero  mean  error  and  have  an  error 

2 

covariance  matrix  E,  it  is  easily  shown  that  a-,  the  error  vari- 
ance  in  f,  is  given  by 

(z)  = a'^(z)  E a (z)  (6) 

f ^ 

where  a (z)  is  the  column  vector  of  coefficients  a^(z),  and  the 

superscript  T denotes  matrix  transposition.  From  the  point  of 

view  of  controlling  the  propagation  of  measurement  noise,  it  is 

2 

desirable  to  choose  the  coefficients  a^(z)  such  that  a^(z)  is 

minimized  at  each  level  z.  However,  it  is  not  possible,  in  general, 

2 

to  minimize  both  Q(z)  and  a^(z)  simultaneously;  therefore,  a com- 
promise is  reached  by  minimizing  a linear  combination.  This 
relationship  can  be  written  as 

R(z)  = wQ(z)  + (1  - w)  ra|(z)  (7) 
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where  the  factor  r ensiares  that  both  terms  have  the  same  physical 
dimensions.  By  varying  the  weight  w,  emphasis  can  be  shifted  from 
minimizing  the  error  to  maximizing  control  over  the  shape  of 
A(z,  z*)-  Thus,  there  is  a tradeoff  between  the  two  considerations, 
and  the  best  choice  for  w must  be  determined  by  the  nature  of  the 
particular  application. 

The  value  of  a(z)  which  minimizes  R(z)  can  be  readily  calcu- 
lated; as  a result,  an  expression  for  f(z)  of  this  form  is  obtained 


/V  T T 

f (z)  = a g = V (z) 


where 


• 

f 

1 - w 

** 

s (z)  + 

r E 

w 

m 

V^(z)  = /r^(z')d(z,  z')J(z,  z')dz' 


(8) 


(9) 


and 

s^^(z)  = / K^(z')Kj  (z’)  J(z,  z')dz'  (10) 

An  interesting  special  case  of  this  form  of  solution  results  when 
J(z,  z*)  = 1 for  all  z and  z* , and  D(z,  z’)  = 6(z  - z*).  Then 

s..  = / K.(z)K.(z)dz  (11) 

ID  1 J 

and 


V^(z)  = K^(z) 


(12) 


Thus,  Eq.  (8)  has  the  form  of  the  "minimum  information"  solution 
(Ref.  11)  for  the  continuous  case 


f(z)  = 


T 

K^(z) 


'/k 


T 


K(z*)K  (z*)dz' 


where 


Y 


r 


(13) 


Thus,  viewed  from  within  the  framework  of  Backus-Gilbert  theory, 
the  minimum  information  solution  is  that  solution  for  which  the 
averaging  kernel  lies  closest  to  a delta  function  subject  to  a 
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constraint  on  error  propagation  controlled  by  the  value  of  y. 

Note  that  for  the  general  form  of  Eq.  (8) , the  matrix  to  be 
inverted  is  a function  of  z so  a matrix  inversion  is  required  for 
each  value  of  z considered.  However,  in  the  special  case  when 
J(z,  z*)  is  independent  of  z,  a single  matrix  inversion  is 
required  for  all  values  of  z.  This  may  be  a nontrivial  point  of 
consideration  when  an  inversion  method  is  being  chosen  for  pro- 
cessing a large  quantity  of  data. 

A second  form  of  Q(z)  which  has  been  used  extensively  is  that 
obtained  with 

D(z,  z*)  = 6 (z  - z*)  and  J(z,  z*)  = 12 (z  - z*)^ 


This  form  generally  results  in  an  averaging  kernel  with  a broader 
central  peak  but  smaller  sidelobes  compared  with  the  averaging  ker- 
nel obtained  by  using  J(z,  z*)  = 1.  This  choice  from  Eq.  (5) 
yields 

Q(z)  = s(z)  = 12  / (z  - z*)^A^(z,  z*)dz*  (14) 


where  s(z)  is  called  the  "spread."  It  has  units  of  z and  is  a 
measure  of  the  spread  of  A(z,  z*)  about  z*  = z.  The  unusual 
normalizing  factor  12  is  chosen  so  a rectangular  A of  unit  area 
has  a value  of  s equal  to  its  width.  Although  this  choice  is  quite 
arbitrary,  it  can  be  demonstrated  that  s is  'a  good  approximation 
to  the  usual  measures  of  the  widths  of  other  well-known  functions. 
Some  examples  are : 

(a)  Gaussian 


A(Z) 

S 


1 

/2Tr'b 

3 


- z^/2h^ 

: e 

X 2b  = 0.85  X 2b 


(b)  Triangular 


A(z) 


!(1  - z/2b)/2b 
0. 


|z|  < 2b 
|z|  > 2b 
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s = 1.2  X 2b 
(c)  Lorentzian 

A(z)  = 

7T 

s = - X 2b  = 0,95  X 2b 

TT 

where  b is  the  half-width  at  ha If -maximum.  Thus,  it  appears  that 
s can  be  taken  as  a reasonable  though  somewhat  unconventional 
measure  of  the  resolution  in  z. 

As  before,  a is  determined  by  minimizing  a linear  combination 
2 

of  s and  Og ; however,  in  this  case,  it  is  necessary  to  impose  an 
additional  constraint  to  obtain  a nontrivial  solution-  The  con- 
straint chosen  by  Backus  and  Gilbert  is  that  A(z,  z*)  be  unimodu- 
lar,  i. e- , 

/ A(z,  z')dz'  = 1 (15) 

This  is  a reasonable  choice  if  f (z)  is  interpreted  as  some  average 
value  of  f-  However,  it  should  be  noted  that,  in  general, 

A(z,  z*)  can  be  negative  for  some  values  of  z * thus,  the  analog 
with  a weighted  average  is  not  complete  unless  one  is  willing  to 
admit  negative  weights.  Carrying  out  the  minimization  process  yields 
W“  ^ ( z ) u 

a(z)  = -J— ~1 ^ (16) 

u W (z)u 

where 


W(z)  = w s(z)  + (1  - w)  r E 

'\J  f\j  <0 


(17) 


= / K^(z)dz 


(18) 


and 

= 12  / (z  - z')2K^(z')Kj {z')dz' 


(19) 
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Thus,  by  varying  the  weight  w a tradeoff  between  error  and  resolu- 
tion as  measured  by  s can  be  obtained.  If  s is  plotted  against 
Cg  as  w varies  from  0 to  1,  a "tradeoff  curve"  is  obtained  which 
can  be  used  to  pick  the  appropriate  value  of  w for  the  application 
at  hand.  One  such  tradeoff  curve  is  obtained  for  each  value  of  z 
considered.  Examples  are  given  in  the  following  sections. 

Other  parameters  can  be  defined  which  are  useful  in  charac- 
terizing the  behavior  of  A(z,  z*).  One  such  parameter  is  the 
"center"  defined  as 

c(z)  = / z'A^(z,z')dzy^J  A^(z,  z')dz*  (20) 

A "resolving  length"  can  then  be  defined  as  the  spread  about  the 
center 

%{z)  = 12  / [c(z)  - z*]^  A^(z,  z*)dz  (21) 

Obviously,  if  f(z)  is  to  represent  a weighted  average  of  f over  a 
region  centered  on  z,  then  we  would  like  to  have  c(z)  - z.  Note 
that  the  spread  also  can  be  written  as 

s(z)  = it(z)  + 12[z  - c(z)]^  / A^(z,  z*)dz*  (22) 

Thus,  s(z)  has  contributions  due  both  to  the  width  of  A(z,  z*)  and 
the  displacement  of  its  center  from  z.  Therefori^,  minimizing  s(z) 
has  the  desirable  property  of  reducing  both  the  width  of  A and  the 
departure  of  its  center  from  the  value  of  z being  considered. 


It  is  of  interest  to  note  that  the  minimiom  information  form 
(Eq.  (13))  does  not,  in  general,  result  in  a unimodular  averaging 
kernel.  However,  the  derivation  leading  to  Eq.  (13)  can  be  modi- 
fied to  incorporate  a unimodular  constraint,  resulting  in  the 
solution 


f (z) 


K^(z)  + 


T -1 
1 - K (z)o3  u 

T -1 

U 0)  u 

- Oj 


(23) 


where 

^ + (1/w  - 1)  r jg 
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III.  APPLICATION  TO  OZONE  PROFILE  RETRIEVAL 

The  problem  of  retrieving  information  on  the  vertical  dis- 
tribution of  ozone  within  the  earth  * s atmosphere  from  measurements 
of  backscattered  ultraviolet  radiance  using  a satellite  borne 
sensor  can  be  analyzed  from  the  point  of  view  of  Backus-Gilbert 
theory.  Although  a considerable  body  of  literature  on  the  treat- 
ment of  this  type  of  inversion  problem  exists  (e.g.,Refs.  12,  13,  and 
14),  no  attempt  will  be  made  here  to  review  the  various  methods. 
Rather,  we  shall  be  concerned  with  more  general  questions  con- 
cerning the  information  content  of  the  measurements - 

The  extraction  of  ozone  information  from  backscattered  radi- 
ance' measurements  is  in  principle  a straightforward  process.  The 
incident  solar  radiation  is  scattered  back  to  the  satellite  sensor 
from  various  levels  within  the  atmosphere  and  from  the  lower 
boundary  surface.  In  addition,  if  the  measurement  is  made  within 
an  ozone  absorption  band,  the  radiation  is  attenuated  by  absorption 
along  the  total  path.  To  a first  approximation,  the  majority  of 
the  radiation  is  backscattered  from  an  effective  scattering  layer. 

If  the  distribution  of  scatterers  (atmospheric  molecules  and 
aerosols)  is  assumed  known,  then  a measurement  of  the  ratio  of  the 
backscattered  radiance  to  the  incident  solar  flux  permits  the 
attenuation  due  to  ozone  absorption  to  be  inferred.  From  a knowl- 
edge of  the  ozone  absorption  coefficient,  the  total  ozone  above 
the  effective  scattering  layer  can  be  inferred-  For  an  estimate 
of  total  ozone,  the  effective  scattering  layer  should  be  located 
in  the  troposphere.  To  obtain  profile  information,  measurements 
at  several  different  wavelengths  are  required,  corresponding  to 
scattering  layers  covering  a range  of  heights  in  the  stratosphere. 

The  problem  of  extracting  ozone  information  divides  naturally 
into  two  distinct  problems : retrieval  of  high  level  profiles  and 
retrieval  of  total  column  abundance.  Since  the  retrieval  of  total 
column  abundance  does  not  require  a profile  inversion  in  the  usual 
sense,  only  the  upper  level  profile  retrieval  will  be  considered 
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here.  Most  investigations  of  profile  inversion  have  been  limited 
to  the  "topside"  or  region  above  the  ozone  peak  so  that  only 
single  scattering  need  be  considered.  For  example,  the  Nimbus  4 
BUV  data  have  been  used  to  retrieve  profiles  only  above  approxi- 
mately the  10  mb  level - 

For  a single  scattering  Rayleigh  atmosphere,  the  backscattered 
spectral  radiance  measured  by  a nadir  viewing  sensor  at  wave- 
length can  be  written  as 

|-Po  -(1  + cos  0)[B.p  + k X(p)] 

I..  = VTT  (1  + cos^0)3.F.  e dp 

1 IbTT  1 X 

0 


where  F^^ is  the  solar  spectral  flux,  6 is  the  solar  zenith  angle, 

3^  is  the  scattering  extinction  coefficient,  k^  is  the  absorption 
coefficient,  X(p)  is  total  ozone  (cm-kPa)  above  pressure  level  p, 
and  p^  is  lower  boundary  pressure  level.  It  is  assumed  that  the 
integrand  in  Eq.  (1)  approaches  zero  as  p p^.  Given  measure- 
ments of  retrieval  problem  then  is  to  solve  the  integral 

equation  for  X(p) . 

In  forming  a linear  inversion  problem,  most  methods  consider 
perturbations  about  some  starting  profile  or  first  guess  X*^(p). 

For  this  discussion,  the  atmosphere  is^  divided  into  discrete  layers 
of  uniform  thickness  in  £n p . Further,  because  of  the  large  range 
of  variation  of  I with  A,  a logrithmic  scaling  is  found  convenient. 
Then  the  deviation  of  iln  I.  from  the  value  it  would  have  for  an 

— X 

o % 

ozone  profile  X (p)  can  be  written  to  first  order 

3 £n  I^ 

* ’•"'i  ■ ? siHir  * “"■‘j 

J D 


where  x.  is  the  amount  of  ozone  in  the  jth  layer.  The  partial 
derivatives  are  evaluated  at  X = X^(p)  with  the  integral  written 
in  numerical  quadrature  form  and  provide  a measure  of  the  sensi- 
tivity of  the  radiance  at  the  ith  wavelength  to  changes  in  the 
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ozone  content  in  the  jth  layer.  In  the  notation  of  Section  II,  the 

partial  derivatives  correspond  to  a discrete  form  of  K^(z),  whereas 

6 £nl.  = g.  and  6 Znx,  = f..  A set  of  kernel  functions  for  seven 
11  3 3 

wavelengths  in  the  Hartley-Huggins  band  is  shown  in  Fig.  1.  (These 
kernels  were  provided  by  C.  L.  Mateer.) 

Using  the  methods  of  Section  II  ^ with  the  spread  as  the  param- 
eters characterizing  the  averaging  kernel,  tradeoff  curves  were 
calculated.  An  example  of  a tradeoff  curve  is  shown  in  Fig.  2 for 
the  3.76  mb  level.  In  this  case,  a random  noise  in  the  measurement 
comparable  to  that  achieved  with  the  Nimbus  4 BUV  experiment  (-1%) 


0,  WEIGHTING  FUNCTIONS 


I a In  1/8  lnx| 

Fig.  I.  Kernel  functions  for  topside  ozone  profile  retrieval 
using  hackscattered  ultraviolet  measurements*  Each  curve  Is 
labeled  by  the  wavelength  (Angstroms)  to  which  It  pertains*  (1  bar 
100  kPa*) 
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Fig.  2.  Tradeoff  curve  at  the  3.76  mb  level  for  ozone  pro- 
file retrieval  from  hackscattered  ultraviolet  measurements.  The 
spread  is  used  here  as  a measure  of  vertical  resolution ^ and  an 
rms  measurement  error  of  1%  was  assumed.  (1  bar  = 100  kPa.) 


was  assumed.  The  L-shape  of  the  curve  is  characteristic  of  remote 
sensing  profile  retrieval  techniques  in  general.  If  an  attempt  is 
made  to  improve  the  resolution  as  measured  by  the  spread  much 
beyond  1 scale  height,  the  rms  profile  error  increases  rapidly. 

By  combining  information  from  curves  such  as  these  from  many  dif- 
ferent levels,  the  spread  as  a function  of  height  was  calculated 
for  several  different  rms  errors  (Fig.  3) . If  a 10%  error  is 
accepted,  then  a vertical  resolution  of  one  scale  height  or  better 
can  be  achieved  between  0.5  and  10  mb.  If,  however,  an  error  no 
larger  than  1%  is  demanded,  virtually  no  vertical  structure  infor- 
mation can  be  obtained. 
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VERTICAL  RESOLimON  OF  RETRIEVED  PROFILES 
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Fig,  3,  Spread  as  a function  of  height  for  various  ozone  pro- 
file errors.  An  rms  measurement  error  of  1%  was  assumed,  (1  bar  = 
100  kPa,) 

IV.  APPLICATION  TO  TEMPERATURE  PROFILE  RETRIEVAL 

Backus-Gilbert  theory  has  found  considerable  use  in  the 
analysis  of  problems  associated  with  the  retrieval  of  atmospheric 
temperature  profiles  from  remotely  measured,  thermally  emitted, 
infrared  radiation.  The  method  has  been  applied  with  considerable 
success  to  the  sounding  of  the  terrestrial  atmosphere;  however,  it 
has  proven  to  be  an  especially  useful  tool  in  analyzing  the  poten- 
tial of  various  types  of  measurements  for  sounding  the  atmospheres 
of  other  planets.  In  the  case  of  the  Earth's  atmosphere,  there  is 
usually  information  on  the  temperature  profile  available  in 
addition  to  the  radiance  measurements.  For  planetary  atmospheres. 
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the  radiance  measurements  are  frequently  all  that  is  available 
for  estimating  a profile.  Examples  of  both  types  will  be  con- 
sidered. 


For  purposes  of  this  discussion,  consider  a nonscattering 
atmosphere  in  local  thermodynamic  equilibrium.  Then  the  spectral 
radiance  in  the  ith  spectral  interval  as  measured  with  a sensor 
located  above  the  atmosphere  can  be  written  as 


I. 

1 


B. (T  ) T . (0) 
1 o 1 


+ 


.oo 


9t  (z) 

[T(z)]  

8 z 


dz 


(24) 


In  this  expression,  z = - 5,n(p/p^),  i.e.,  z would  be  the  geometric 
height  expressed  in  units  of  scale  height  for  an  isothermal  atmos- 
phere. Planck  function  for  temperature  T within 

spectral  inverval  i (assumed  to  be  narrow),  and  t^(z)  is  the  atmos- 
pheric transmittance  between  level  z and  the  sensor.  The  term 
Bi(To)T^i(0)  is  the  contribution  from  the  lower  boundary  located  at 
pressure  level  p^  here  assumed  to  be  that  of  a blackbody  at  temper- 
ature T . This  term  is  usually  specified  from  measurements  in 

0 

transparent  spectral  intervals , 

The  problem  then  is  to  use  measurements  of  to  retrieve 
information  on  the  temperature  profile  T(z).  The  problem  is  first 
linearized  by  expansion  about  an  appropriately  chosen  reference 
profile  T^(z) ; this  results  in  the  set  of  linear  integral  equations 
00 

AI.  = / K.(z)AT(z)dz  (i  = 1,  2,  ...  la)  (25) 

1 o ^ » 


where  m is  the  number  of  spectral  intervals  for  which  measurements 

exist.  The  radiance  difference  is  Al.  =1.  - I?  where  I . is  calculated 

111  1 

from  Eq.  (24)  using  the  reference  profile,  and  AT(z)  = T(z)  - T (z) . 
The  kernel  functions  K^(z)  are  given  by 


K^(z) 


d [T^(z)]  9t^(z) 
d T 3z 


(26) 
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These  equations  are  now  of  the  form  of  Eq.  (1)  so  the  methodology 
of  Section  ll  can  be  applied. 

A.  Earth 

Remote  temperature  sounding  in  the  Earth ^s  atmosphere  has 
been  carried  out  using  measurements  within  the  15  pm  and  4 . 3 pm  CO^ 
bands  as  well  as  measurements  in  microwave  lines.  Only  soundings 
within  the  15  pm  band  will  be  considered, 

Conrath  (Ref.  5)  analyzed  measurements  within  the  15  pm  band 
in  terms  of  the  tradeoff  between  averaging  kernel  spread  and  pro- 
file error,  assuming  an  error  covariance  matrix  of  the  form 

E = 1 (27) 

^ e 

where  is  the  variance  of  the  measurement  noise  and  is  assumed 
e 

to  be  the  same  in  all  spectral  intervals  and  1 is  the  unit  matrix. 
With  these  assumptions,  it  was  possible  to  calculate  tradeoff 

Q O 

curves  in  terms  of  spread  as  a function  of  a /a  . Two  cases  were 

considered,  one  for  a set  of  seven  spectral  intervals  and  the  other 

for  a set  of  16  intervals.  Kernel  functions  for  the  seven-interval 

set  are  shown  in  Fig.  4.  Typical  tradeoff  curves  for  the  49-mb 

level  are  shown  in  Fig.  5.  Again,  the  characteristics  L-shape  curves 

result.  Averaging  kernels  from  selected  points  on  the  tradeoff  curve 

for  the  seven-interval  set  (broken  curve  in  Fig.  5)  are  shown  in 

Fig.  6.  The  first  averaging  Icernel  corresponds  to  a point  near  the 

minimum  error-maximum  spread  end  of  the  tradeoff  curve  while  the 

last  is  from  the  maximum  error-minimum  spread  end  of  the  curve.  Note 

that  the  latter  has  a substantially  more  narrow  central  peak  than 

the  former  but  has  developed  sidelobes  with  negative  excursions. 

The  center  as  a function  of  height  calculated  using  Eq.  (20) 

is  shown  in  Fig.  7.  The  same  value  of  o^/o^  was  used  at  all  levels. 

T e 

Ideally,  c(z)  should  lie  on  the  broken  diagonal  line  shown  in  the 
figure.  However,  above  approximately  10  mb,  c(z)  stops  increasing. 
This  level  corresponds  to  the  peak  of  the  uppermost  kernel  function. 
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Fig,  4.  Kernel  functions  for  the  iS-y/n  CO^  band  in  the  ter- 
restrial atmosphere.  The  labels  1,2,6,  7,  8,  10,  and  14  refer 
to  frequencies  667.5  cm  ^ , 677.5  cm  ^ , 697.5  cm  ^ , 702.5  cm  ^ , 
707.5  cm  ^ , 727.5  cm  ^ , and  747.5  cm  ^ , respectively. 

(1  bar  = 100  kPa.) 


and  essentially  no  useful  information  is  obtained  above  that  level. 
The  corresponding  resolving  length  as  a function  of  height  is  given 
in  Fig.  8.  The  values  shown  correspond  to  forroal  rms  temperature 
errors  of  1 to  2 K for  instrument  noise  levels  typical  of  those 
achievable  with  existing  spectrometers  operating  in  the  15-ym  band. 
Thus,  the  resolution  as  measured  by  the  resolving  length  is  approxi- 
mately 0.5  scale  height  in  the  lower  troposphere,  but  degrades  to 
in  excess  of  two  scale  heights  at  10  mb. 
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Fig.  5.  Tradeoff  curves  at  the  49- nib  level  for  temperature 
profile  retrieval  from  measurements  within  the  15~]Am  CO^  band  In 
the  terrestrial  atmosphere.  The  broken  curve  Is  for  a set  of 
seven  spectral  Intervals  while  the  solid  curve  is  for  a sixteen 
Interval  set.  (1  bar  = 100  kPa  ; 1 erg  = 10  J.) 


Fleming  (Ref.  7)  has  considered  the  problem  of  attempting  to 
construct  averaging  kernels  which  are  approximately  rectangular  in 
shape.  This  effort  was  motivated  by  the  fact  that  in  meteorolog- 
ical applications,  the  thickness  of  atmospheric  layers  between 
constant  pressure  surfaces  is  frequently  the  desired  quantity. 
Since  the  thickness  is  proportional  to  the  mean  temperature  of  a 
layer,  a rectangular  averaging  kernel  permits  direct  estimation  of 
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Fig.  7.  Center  as  function  of  scale  height  for  the  seven  spec 
tral  Interval  set  whose  kernel  functions  are  shown  In  Fig.  4. 

(1  bar  = 100  kPa.) 


thicknesses-  For  this  purpose,  D(z,  z*)  in  Eq-  (5)  was  taken  as 
a rectangular  function,  and  the  weight  used  was 

J(z,  z*)  = [1  - D(z,  z')]^  (28) 

This  choice  of  weight  acts  as  a penalty  function  which  tends  to 
suppress  the  amplitude  of  A(z,  z')  outside  the  region  delineated 
by  D(z,  z').  Fleming's  results  showed  that  a slightly  better 
thickness  estimate  could  be  obtained  in  this  way  than  by  integrating 
a retrieved  temperature  profile,  although  the  improvement  was  con- 
sidered marginal. 

Recently,  Rodgers  (Ref.  9)  has  employed  the  Backus-Gilbert 
formulation  to  analyze  the  vertical  resolution  achievable  with 
statistical  estimation  techniques.  The  basic  approach  used  was  to 
treat  the  available  a priori  statistics  as  a measurement  of  the 
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Fig.  8.  Resolving  length  as  a function  of  scale  height  for 
the  seven  spectral  Interval  set  whose  kernel  functions  are  shown 
In  Fig.  4.  The  assumptions  used  In  the  calculations  are  described 
In  the  text.  (1  bar  = 100  kPa,) 


temperature  profile  with  high  vertical  resolution  but  large  error. 
The  results  of  the  study  indicate  that  the  use  of  statistics  along 
with  the  radiance  measurements  considerably  improves  the  vertical 
resolution  over  that  achieved  with  the  radiance  measurements  alone. 
For  example,  by  using  statistics  representative  of  forecast  errors, 
it  was  found  that  an  improvement  in  resolution  by  about  a factor 
of  two  can  be  achieved  at  a profile  error  level  of  0.5  K. 

B . Mars 

The  Mariner  9 spacecraft,  launched  into  orbit  about  Mars  in 
November  1971,  carried  a Michelson  interferometer  operating  in  the 
infrared.  Measurements  within  the  15 -ym  CO^  band  were  used  to 
retrieve  temperature  profiles  on  a near-global  basis.  Although  the 
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average  basal  pressxire  of  the  atmosphere  is  only  about  5 mb,  the 
fact  that  it  is  almost  pure  carbon  dioxide  permitted  a large  por- 
tion of  the  lower  atmosphere  to  be  sounded. 

Because  of  the  virtual  nonexistence  of  a priori  information 
on  the  Martian  thermal  structure,  only  the  radiance  measurements 
were  available  for  retrieval  purposes.  Kernel  functions  for  a 5- 
interval  set  of  measurements  are  shown  in  Fig.  9.  An  analysis  was 
carried  out  with  this  set  of  functions  in  terms  of  a tradeoff 
between  spread  and  error.  The  resulting  vertical  resolution  as 
measured  by  the  resolving  length  is  shown  in  Fig.  10  as  a function 
of  height  for  a formal  rms  temperature  profile  error  of  2 K.  The 
center  as  a function  of  height  is  shown  in  Fig.  11.  For  purposes 
of  comparison  with  Figs.  7 and  8,  the  Martian  pressure  scale  height 
is  approximately  10  km.  Not  surprisingly,  the  results  for  Earth 
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Fig.  9.  Set  of  15-ym  CO^  band  kernel  functions  used  for 
temperature  sounding  in  the  Martian  atmosphere.  (1  har  = 100  kPa.) 
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Fig.  10.  Resolving  length  as  function  of  height  in  the 
Martian  atmosphere  obtained  with  the  kernel  functions  shown  in 
Fig.  9.  (1  har  = 100  kPa.) 

and  Mars  are  quite  similar.  The  behavior  of  c{z)  indicates  that 
little  useful  information  is  obtained  above  0.1  mb,  and  the  reso- 
lution varies  from  slightly  greater  than  0.5  scale  height  in  the 
lower  atmosphere  to  in  excess  of  2 scale  heights  at  the  upper  limit 
of  the  sounding  region.  Unlike  the  Earth,  however,  the  vertical 
resolution  for  Mars  cannot  be  significantly  improved  at  the  present 
time  because  of  a lack  of  additional  information.  Nevertheless, 
almost  20  thousand  Martian  temperature  profiles  have  been  retrieved 
(Refs.  15  and  16)  and  used  to  study  the  dynamic  regime  of  the  lower 
atmosphere  (Refs.  17,  18,  and  19). 
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Fig- . 11,  Center  as  function  of  height  In  the  Martian  atmo- 
sphere obtained  with  the  kernel  functions  shown  In  Fig,  9. 

(1  bar  = 100  kPa,) 

C-  The  Jovian  Planets 

The  atmospheres  of  the  Jovian  planets  (Jupiter,  Saturn, 

Uranus,  and  Neptune)  are  composed  primarily  of  hydrogen  and  helium 
with  small  admixtures  of  methane,  ammonia,  and  other  gases.  The 
hydrogen  spectrum  includes  pressure  induced  lines,  two  of  which 
are  centered  at  -360  cm  ^ and  600  cm  \ These  lines  are  quite 
broad  100  cm  and  can  be  used  for  temperature  sounding  even 
with  measurements  of  moderate  spectral  resolution.  In  addition, 
the  CH^  band  can  be  used  for  temperature  retrieval,  provided  that 
measurements  wi-th  a spectral  resolution  of  the  order  of  a few  cm 
are  available.  Kernel  functions  for  Jupiter  for  a set  of  spectral 
intervals  including  both  hydrogen  and  methane  absorption  features 
are  shown  in  Fig.  12.  The  uppermost  kernel  requires  a spectral 
resolution  of  1 cm  ^ , while  the  remaining  kernels  are  for  5 cm  ^ 
resolution - 
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Fig.  12.  Kernel  functions  for  temperature  profile  retrieval 
in  the  atmosphere  of  Jupiter.  Spectral  intervals  within  the 
methane  hand  and  pressure  induced  hydrogen  lines  are  included. 

(1  bar  = 100  kPa.) 
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Earth-based  measurements'  of  the  Jovian  thermal  emission  spec- 
trum have  been  used  to  obtain  disk- averaged  temperature  profiles 
(see,  e.g. , Ref.  20) . In  addition,  the  Mariner  Jupiter/Saturn  (MJS) 
1977  Mission  will  carry  a Michelson  interferometer  capable  of 
giving  high  quality  spectral  measurements  at  good  spatial  reso- 
lution during  flybys  of  Jupiter,  Saturn,  and  possibly  Uranus. 
Therefore,  there  is  considerable  interest  in  analyzing  the  capa- 
bilities of  measurements  of  this  type.  The  set  of  kernels  shown 
in  Fig.  12,  excluding  the  uppermost  kernel,  were  employed  in  a 
Backus-Gilbert  analysis.  The  results  are  simmarized  in  Fig.  13 


SPREAD  SCALE  HEIGHT 


Fig.  13.  Spread  as  a function  of  height  in  the  Jovian  atmo- 
sphere obtained  with  the  kernel  functions  shown  in  Fig.  12.  Both 
the  minimum  spread  obtainable  and  the  spread  obtainable  for  the 
temperature  profile  errors  indicated  are  shown.  Measurement  errors 
consistent  with  current  state-of-the-art  instrumentation  were 
assumed.  (1  bar  = 100  kPa.) 
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where  spread  as  a function  of  height  is  plotted.  The  hydrogen  and 
methane  regions  were  treated  separately,  and  two  curves  are  shown 
for  each;  one  represents  the  minimum  spread  achievable  and  the 
second  is  the  spread  obtainable  with  the  profile  errors  indicated. 

An  instriament  noise  level  equal  to  that  anticipated  for  the  MJS  1977 
interferometer  was  assumed.  The  rapid  increase  of  the  spread  as 
either  end  of  a soimding  region  is  approached  is  due  to  the  fact 
that  c(z)  does  not  increase  outside  the  region  from  which  useful 
information  is  obtained  (see  Eq.  (22)).  Thus,  the  total  vertical 
range  covered  by  the  combined  spectral  regions  is  slightly  over  two 
pressure  decades,  although  the  lower  limit  may  be  set  in  practice 
by  the  presence  of  cloud  decks.  As  a final  example,  an  analysis  for 
Uranus  using  only  the  hydrogen  absorption  features  is  presented  in 
Fig.  14.  The  results  are  similar  to  those  for  Jupiter  except  that 
the  pressure  range  over  which  information  can  be  obtained  is  some- 
what displaced. 
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Fig.  14.  Spread  as  a function  of  height  in  the  atmosphere  of 
Uranus  obtained  with  the  kernel  functions  shown  in  Fig.  12, 

(1  bar  = 100  kPa.) 
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Recently,  Gautier  and  Revah  (Ref.  10)  have  provided  an 
analysis  of  the  Jovian  temperature  sounding  problem  from  a more 
conventional  information  theory  point  of  view,  and  it  is  of  inter- 
est to  compare  the  results  of  that  study  with  those  obtained  with 
Backus-Gilbert  theory.  In  the  study  of  Gautier  and  Revah,  it  was 
assumed  that  the  radiative  transfer  Eq.  (24)  could  be  written  in 
a linearized  form 


G(V)  = j B[v^,  T(p)]  e -g  dp  (29) 

where  G(v)  is  a function  of  the  measured  radiance  at  frequency  v, 
and  is  a reference  frequency.  The  Jovian  atmosphere  was  assumed 
to  be  infinitely  deep  so  there  is  no  boundary  term.  For  measure- 
ments within  the  hydrogen  lines,  the  optical  depth  has  the  func- 
tional form 

t(v,  p)  = k(v)p^  (30) 

By  introducing  the  new  variables, 

n = ^^^n  t(v  , p)  (31) 

o 


5 = -In  k(v)/k(v  ) 
o 


(32) 


Eq. 


(29)  can  be  rewritten  in  the  convolution  form 

r 


g(?) 


K(g  - n)  f (n)<3n 


(33) 


where  g(5)  is  obtained  from  the  measurements,  f(n)  is  related  to 
the  temperature  profile,  and  the  kernel  has  the  functional  form 


K(C  - n)  = exp[-  (?  - n)  - exp(5  - n)] 


(34) 


By  eui  appropriate  continuation  of  g(C)  outside  the  range  of  5 for 
which  measurements  are  possible,  the  convolution  theorem  can  be 
applied  to  Eq.  (33)  to  obtain 

g*(k)  = K*(k)  f*  (k)  (35) 

where  the  asterisks  denote  a Fourier  transformation  and  k is 
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interpre-Ced  as  a spatial  frequency  in  the  vertical.  It  is  assumed 
that  the  measurements  are  contaminated  by  noise  whose  Fourier  spec- 
trum N*(k)  is  essentially  constant  over  the  range  of  interest.  In 
general,  jg*(k)|  will  tend  to  be  a monotonically  decreasing  func- 
tion of  k.  At  some  value  of  k,  such  as  k , lg*(k) I will  become 

m 

equal  to  the  noise  |N*(k)j.  Therefore,  for  higher  spatial  fre- 
quencies, no  useful  information  can  be  obtained.  The  minimum  sam- 
pling interval  5n  required  to  reproduce  all  the  profile  information 
contained  in  the  measurements  is  obtained  from  Shannon's  sampling 
theorem 

6n  = ^ (36) 

m 

Gautier  and  Revah  define  the  vertical  resolution  to  be  6n,  which 
is  one-half  of  the  shortest  wavelength  retrievable.  By  assuming 
hydrostatic  equilibrium  and  using  Eqs.  (30)  and  (31),  Eq.  (36) 
can  be  rewritten 


6 z 


(37) 


where,  as  before,  z is  height  in  units  of  scale  height.  Thus,  if 

k can  be  established,  the  vertical  resolution  dan  be  estimated, 
m 

To  calculate  k , g* (k)  must  be  known.  Gautier  and  Revah  estimated 
m 

this  quantity  by  using  a model  Jovian  atmosphere  to  which  white 
noise  of  approximately  2 K standard  deviation  was  added  to  simulate 
fine  scale  structure.  This  permitted  6 z to  be  calculated  as  a 
function  of  measurement  signal  to  noise  ratio. 

It  is  obvious  that  the  Gautier-Revah  and  Backus -Gilbert 
approaches  are  philosophically  somewhat  different,  as  the  former 
makes  use  of  the  anticipated  properties  of  the  profile  to  be 
retrieved  while  the  latter  depends  solely  on  the  behavior  of  the 
kernel  functions.  Nevertheless,  it  is  of  interest  to  compare  the 
results  obtained  with  the  two  methods.  For  signal  to  noise  ratios 
comparable  to  those  anticipated  in  the  hydrogen  lines  with  the 
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MJS  interferometer,  the  calculations  of  Gautier  and  Revah  indicate 
a vertical  resolution  of  approximately  0.5  scale,  height  which  is 
consistent  with  the  Backus-Gilbert  results  shown  in  Fig.  14. 

V.  CONCLUDING  REMARKS 

The  Backus-Gilbert  theory  has  proven  to  be  a useful  tool  for 
analyzing  the  potential  of  various  types  of  remote  radiation 
measurements  for  the  retrieval  of  atmospheric  profiles.  While  it 
has  generally  come  to  be  regarded  as  a means  of  studying  the  trade- 
off between  vertical  resolution  and  profile  error,  its  applicability 
is,  in  fact,  considerably  broader.  It  can  be  used  to  construct 
averaging  kernels  whose  shapes  are  dictated  by  the  requirements  of 
a particular  application.  The  extent  to  which  the  method  is  suc- 
cessful can  be  judged  on  the  basis  of  whether  the  resulting 
averaging  kernels  are  better  approximations  to  what  is  required 
than  are  the  kernel  functions  of  the  original  set  of  integral 
equations. 

Examples  from  several  widely  differing  applications  were  pre- 
sented in  this  review-  However,  the  results  were  quite  similar 
in  all  cases.  The  broad,  smooth  kernels  associated  with  radiative 
transfer  processes  whether  it  is  backscattered  solar  radiation  or 
thermally  emitted  radiation  cannot  be  combined  into  sharply  peaked 
averaging  kernels  without  incurring  strong  propagation  of  measure- 
ment errors.  However,  examination  of  each  case  reveals  that  for 
reasonable  error  levels,  averaging  kernels  can  be  obtained  which 
are  narrower  than  the  original  radiative  transfer  kernels.  In  this 
sense,  inversion  of  the  data  sets  is  judged  to  be  worthwhile.  In 
the  case  of  temperature  profiles  in  the  Earth’s  atmosphere,  a large 
body  of  information  is  available  in  addition  to  radiance  measure- 
ments and  this  information  can  be  used  to  improve  the  quality  of 
the  retrievals.  For  other  planetary  atmospheres,  frequently  little 
other  info2rmation  is  available-  In  such  cases,  analyses  of  the 
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type  presented  are  particularly  useful  in  establishing  the  value 
and  limitations  of  retrieved  profiles. 


The  Backus-Gilbert  theory  is,  of  course,  not  unique  in  its 
ability  to  analyze  vertical  resolution.  Another  approach,  which 
is  of  interest  because  of  its  use  of  more  conventional  information 
theory  techniques,  was  briefly  reviewed  and  found  to  give  essen- 
tially the  same  vertical  resolution,  at  least  for  the  one  example 
considered. 
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SYMBOLS 

column  vector  of  coefficients  (Eq,  (6)) 
coefficients  used  in  linear  estimation 
averaging  kernel 
half-  width  at  half--maximum 

Planck  radiance  at  temperature  T and  ith  frequency 
center  of  averaging  kernel  defined  in  Eq,  (20) 
desired  function  in  definition  of  Q(z) 
measurement  error  covariance  matrix  (Eq.  (6)) 
matrix  elements 
unknown  profile 

value  of  f corresponding  to  jth  layer 
inferred  property  of  unknown  profile 
solar  flux  at  ith  frequency  or  wavelength 
vector  of  measurement  quantites  (Eq.  (8)) 
measured  quantities 

function  of  measured  radiance  at  frequency 
radiance  at  ith  frequency  or  wavelength 
weight  factor  in  definition  of  Q(z) 
absorption  coefficient 
kernel  functions 
kernel  elements 

resolving  length  of  kernel  defined  in  Eq.  (21) 
atmospheric  pressure  level 
lower  boundary  pressure  level 
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Or 


quadratic  form  used  to  control  shape  of  averaging  ker- 
nel 

factor  used  in  R(z) 

2 

linear  function  of  Q(z)  and  a^(z) 
matrix  defined  in  Eq.  (8) 

spread  of  averaging  kernel  defined  in  Eq.  (14) 
matrix  elements  defined  in  Eq.  (10) 

optical  depth  from  level  p to  top  of  the  atmosphere  at 
frequency 

atmospheric  temperature  as  a function  of  pressure 

reference  temperature  profile 

vector  defined  in  Eq.  (16) 

vector  elements 

vector  defined  in  Eq.  (8) 

vector  elements 

weight  used  in  R(z) 

matrix  defined  in  Eq.  (17) 

matrix  elements 

amount  of  ozone  in  jth  layer 

amount  of  ozone  in  an  atmospheric  column  above  pres- 
sure level  p 
first  guess  for  X(p) 
height-related  independent  variable 
scattering  extinction  coefficient 
height-related  variable  defined  in  Eq.  (31) 
frequency 

frequency  related  variable  defined  in  Eq.  (32) 
error  variance  in  f(z) 
measurement  error  variance 
error  variance  in  T 

atmospheric  transmittance  from  level  z to  the  top  of 
the  atmosphere  at  ith  frequency 
matrix  defined  following  Eq.  (23) 


185 


Illllllllllllllllllllllllllllll 


matrix  elements 
delta  fianction 
= (1  - w)r/w 
solar  zenith  angle 
reference  frequency 
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DISCUSSIONS 

Chahine:  Do  you  really  need  to  use  the  Gilbert-Backus  method  to 

find  that  the  optimum  spacing  is  reduced  to  half  width? 

Conrath:  It  depends  on  what  you  are  trying  to  do.  It  is  not 

always  true.  The  optimum  resolution  turns  out  to  be  of  the  order 
of  the  half-width  of  the  original  kernel-  However,  you  can  improve 
on  it  a bit  by  moving  around  on  the  trade-off  curve,  particularly 
in  the  vicinity  of  the  elbow.  It  depends  on  how  you  wish  to  trade 
off  between  resolution  noise. 

Chahine:  The  degree  of  resolution  is  not  only  a function  of  the 

kernel  and  the  data,  but  it  is  also  a function  of  the  structure  of 
the  solution  profile.  To  make  my  point  clear,  I need  to  draw  a 
very  brief  diagram  on  the  blackboard. 

Conrath:  I think  the  point  is  that  you  have  to  be  very  careful  by 

what  you  mean  by  "resolution,"  Go  ahead. 

Chahine:  If  you  are  trying  to  "resolve"  the  tropopause  from  remote 

sensing  data  you  will  find  that  this  is  an  extremely  difficult  task 
no  matter  how  narrow  the  weighting  functions  are.  This  is  a result 
of  the  fact  that  the  outgoing  radiance  is  a weak  function  of  the 
value  of  temperature  at  the  tropopause.  On  the  other  hand,  one  can 
resolve  the  stratopause  with  the  same  set  of  weighting  function. 
Thus,  my  question  is:  How  can  you  get  the  resolving  power  without 

taking  the  structure  of  the  solution  into  account? 


Fig.  D-1, 

Conrath:  Again,  I think  it  depends  on  how  you  define  the  resolving 

power  of  the  resolution  essentially.  In  the  case  of  Backus-Gilbert, 
if  you  use  the  spread  of  the  resolving  length  as  the  parameter,  it 
is  in  the  same  sense  that  you  would  use  the  width  of  a spectrometer 
slit  function  as  a measure  of  your  resolution.  Obviously,  what 
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sort  of  features  you  can  see  depend  completely  on  the  details  of 
the  spectrum.  You  can  see  or  detect  a spectral  line  much  nar- 
rower than  your  slit  function  provided  it's  strong  enough.  You’ll 
see  it  spread  out,  of  course. 

Chahine:  Then  would  I change  my  resolving  power  if  my  solution 

happened  to  be  like  in  Fig.  D-l(a)  instead  of  being  like  in  Fig. 
D-l(b) ? 

Conrath:  Well,  not  in  the  definition  I am  using.  So  you  are  free 

to  define  it  however  you  prefer, 

Chahine : Okay . 

Conrath:  So  you  have  to  take  it  in  the  context  in  which  it  is  used. 

Chahine:  So  the  resolving  power  that  I wanted  to  ask  is,  is  the 

function  really  of  the  solution  you  are  after? 

Conrath:  If  you  want  to  define  it  your  way,  yes, 

Kaplan:  But  he  is  not  defining  it  .that  way. 

Conrath:  I think  it  is  arbitrary. 

Rodgers:  The  situation  that  Dr.  Chahine  has  drawn  on  the  board, 

the  resolving  power  is  identical  in  both  cases.  There  is  no  dis- 
tinction between  those  two  profiles.  It  is  just  a matter  of  sign, 
providing  it  is  a reasonably  linear  problem  and  the  15  micron  band 
is  reasonably  linear  in  that  case.  There  is  quite  a distinction 
between  information  and  signal  and  it  must  be  realized.  The  eye 
may  see  a low-wattage  bulb  because  the  eye  is  a logarithmic  device - 
The  detectors  we  use  are  linear  devices. 

Chahine:  If  the  solution  happened  to  be  the  stratopause,  I can 

resolve  it;  if  it  is  the  tropopause,  I cannot.  What  is  the 
resolving  power  then? 

Rodgers:  The  resolving  power  is  something  that  doesn’t  have  any 

kind  of  meaning  in  your  relaxation  method. 

Chahine:  No,  I am  not  thinking  in  terms  of  the  relaxation  method. 

Rodgers:  In  Backus-Gilbert  method,  those  two  situations  are  iden- 

tical. The  sign  is  irrelevant.  Absolute  value  is  irrelevant. 

Drayson:  I think  the  point  of  nonlinearity  here  is  very  important. 

If  you  work  in  the  4.3  micron  band  the  temperatures,  the  Planck 
function  is  a very  nonlinear  function  of  temperature.  But  if  one 
is  holding  for  the  Planck  function  and  gets  a temperature  from  that. 
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then  you  do  have  a linear  problem.  So,  I think  the  point  Clive  is 
making  about  the  in str lament  senses  in  a linear  sort  of  way  is  a 
very  good  point  here.  But  if  you  are  trying  to  get  temperature 
directly,  what  Chahine  says  about  the — if  you  are  thinking  on  a 
4.3  micron  band,  then  you  can't  see  that.  You  very  much  enhance 
the  positive  temperature  there  because  the  signal  is  much  much 
stronger,  because  of  a nonlinearity  of  the  Planck  function. 

Kaplan:  Of  course,  nobody  is  interested  in  the  Planck  function 

unless  it's  a game  to  play.  It's  really  the  temperature  that  we 
are  after,  and  so  the  nonlinearity  does  come  in  and  it  is  crucial. 
I'll  talk  more  about  this  later. 

Westwater:  I really  was  going  to  amplify  Dr.  Rodgers'  comments. 

But  what  the  Backus-Gilbert  technique  really  does  is  estimate  one 
functional  from  another  and  the  functional  that  you  estimate  is 
the  convolution  of  the  averaging  kernel  with  the  particular  profile 
that  you  are  trying  to  retrieve.  And,  of  course,  that  functional 
will  vary  depending  on  the  radiance  that  your  observations  have 
sensed.  But  the  measure  of  resolution  itself,  mainly  the  spread 
of  the  averaging  kernel,  does  not  depend  on  the  profile  you  are 
trying  to  retrieve.  And  I think  this  is  what  Clive  Rodgers  also 
has  said. 

Conrath:  Yes,  again  in  the  sense  of  the  analogy  with  the  instru- 

ment slit  function  of  a spectrometer  in  sensing  a spectrum;  what 
you  actually  see  depends  on  the  spe.ctriom- 

Twomey:  I simply  wanted  to  comment  that  the  possibilities  of 

getting  a trade-off  curve  and  the  scanning  function  or  slit  function 
or  whatever,  this  can  be  applied  with  any  linear  method  at  all. 

But  in  the  Gilbert-Backus  procedure  it  is  looked  at  explicitly. 

The  nonlinear  method,  you  produce  a combination  of  your  measurements, 
your  g's,  whatever  they  may  be  called.  And  so  provided  it  is  a 
linear  method,  your  solution  is  a convolution  type  operation  or  is 
gotten  by  a convolution  type  operation  on  your  inaccessible  f(x). 

And  you  can  always  calculate  this  function  and  you  can,  of  course, 
look  at  the  spread  of  it.  It  is  not  a specific  property  of  the 
Gilbert-Backus  procedure. 

Conrath:  That's  right.  In  fact,  you  can  derive  various  other 

algorithms  from  the  Backus-Gilbert  point  of  views  simply  by  mini- 
mizing a different  quadratic  form. 

Barkstrom:  I would  like  to  make  a comment  and  see  if  the  procedure 

that  I am  suggesting  is  correct  on  the  Backus-Gilbert,  that  in  the 
procedure  if  you  go  back  to  the  original  papers,  for  example, 

Backus'  paper  and  Backus-Gilbert  and  Parkers  exposition,  there's 
a suggestion  in  there  that  you  would  go  ahead  and  perfoonn  an 
inversion  and  then  the  trade-off  curve  is  computed  quite  separately. 
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It  would  suggest  an  inversion  procedure  where,  in  fact,  you  per- 
formed the  original  inversion  procedure  and  then  smoothed  after 
you  have  done  the  retrieval.  Is  that  a . . .? 

Conrath:  That  is  not  my  Impression  of  what  Backus  and  Gilbert 

originally  had  in  mind.  I suppose  you  could  approach  it  from  that 
point  of  view.  If  you  use  it  as  an  inversion  method,  the  retrieval 
you  get  depends  on  where  you  are  on  the  trade-off  curve.  You  have 
to  choose  your  value  of  what  I call  ”w**  here, 

Barkstrom:  How  constant  is  the  value  of  ”w”? 

Conrath:  What  do  you  mean  "how  constant"? 

Barkstrom:  If  you  choose  a given  tolerance  on  your  retrieved  pro- 

file, how  constant  is  the  value  of  "w"  across  the  atmosphere? 

Conrath:  You  mean  in  going  from  one  sounding  to  another? 

Barkstrom:  Going  from  one  region  of  the  atmosphere  to  another. 

Conra th : Hor izon tal ly ? 

Barkstrom : Presumably  vertically. 

Conrath:  Vertically,  you  can  choose  "w"  different  at  different 

levels.  In  the  type  of  Backus-Gilbert  method  I have  outlined  here, 
you  essentially  do  the  analysis  at  each  individual  level  inde- 
pendent of  all  other  levels. 

Barkstrom:  Yes,  and  then  if  you  set  the  tolerance  on  the  retrieve 

profile  at  some  particular  value  , . . 

Conrath:  No,  "w"  would  change  with  height  in  general, 

Fleming:  Just  to  go  back  to  Roland  Drayson*s  comments,  I think  it 
is  a matter  of  how  you  treat  your  kernel  function.  If  you  linearize 
the  Planck  function,  you  get  a factor  derivative  of  the  Planck 
function  with  respect  to  temperature.  In  that  case,  if  you  simply 
redefine  your  kernel  functions  to  include  the  original  waiting 
function  times  that  factor  and  call  that  new  quantity  the  kernel 
function,  then,  indeed,  the  kernel  is  temperature  dependent  and 
then  the  resolving  power  will  be  a function  of  temperature.  But, 
if  you  use  the  method  of  Dr,  Chahine,  he  specifically  objects  to 
linearizing  the  Planck  function,  in  which  case,  he  must  live  with 
the  original  weighting  function  which  is  temperature  independent, 
except  for  there  is  a dependence  but  in  another  sense.  In  which 
case,  I would  have  to  agree  with  Dr.  Conrath, 
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Conrath:  I think  what  you  * re  talking  about  essentially  is  a kernel 

of  the  form  (dx/dx  x dB/dT) . 

Fleming:  Yes.  In  that  case,  it  becomes  temperature  dependent. 

If  you  actually  define  your  kernel  functions  that  way,  then  you 
have  included  a temperature  dependence  and  I would,  of  course,  have 
to  agree  with  Dr.  Chahine. 

Susskind:  I'm  a little  confused  about  what  you  mean  by  "resolution,” 
I just  want  to  ask  a very  simple  question.  Say  we  have  a range  of 
atmosphere  10  scale  heights  and  you  have  10  measurements.  Is  it 
possible  to  have  a resolution  of  better  than  one  scale  height?  Can 
you  by  this  Backus-Gilbert  theory  show  the  optimum  is  better  than 
one  scale  or  you'd  be  showing  my  10  measurements  aren't  really  all 
that  independent  and  maybe  I can  only  get  two  scale  heights?  I 
mean,  is  it  possible  to  have  more  resolution  than  you  have  measure- 
ments in  the  range  in  which  you  are  trying  to  measure  things? 

Conrath:  I think  the  answer  is  no. 

Susskind : I should  think  so  too,  but  I wasn't  quite  clear.  So  in 

other  words  you  could  only  be  degrading  what  you  think  you  have? 

Conra th : That  * s r ight . 

Susskind:  Not  doing  any  better  than  what  it  appears  to  be.  All 

right,  so  you're  talking  about  the  optimum  niimber  of  independent 
things  you  can  think  you  measure , and  not  do  any  better , Good . 


193 


INVERSION  OF  liJFRARED  LIMB  EMISSION 
MEASUREMENTS  FOR  TEMPERATURE  AND 
TRACE  GAS  CONCENTRATIONS 


John  C.  Gille  and  Paul  L.  Bailey 
National  Center  for  Atmospheric  Research 


Limb  emission  measurements  are  characterized  by  sharp 
weighting  functions  at  high  altitudes / and  for  temperature 
determinations,  strongly  nonlinear  dependence  of  the 
weighting  function  on  the  temperature.  Several  methods  for 
inverting  this  type  of  measurement  have  been  described  and 
used,  including  iterative,  statistical , nonlinear  and 
approximate  direct  approaches.  These  approaches  will  be 
described;  advantages  and  disadvantages  of  each  will  be 
outlined. 


I . INTRODUCTION 

In  techniques  utilizing  limb  emission,  often  called  the  limb 
scanning  approach,  the  data  for  inversion  are  measurements  of  the 
radiance  emitted  by  the  atmosphere,  made  while  a passive  radiometer 
scans  from  the  planet  to  space  across  the  planetary  limb.  The 
change  in  geometry  changes  the  inversion  problem  from  that  for 
downward  or  nadir  viewing  instruments,  most  importantly  by  making 
the  problem  of  temperature  determination  strongly  nonlinear,  but 
more  agreeably  by  yielding  narrower  weighting  functions. 

The  geometry  of  limb  radiance  measurements  is  shown  in  Fig.  1. 
The  advantages  of  the  technique,  which  follow  from  the  geometry, 
have  been  described  by  Gille  and  House  (Ref.  1)  [herein  referred 
to  as  GH]  . They  need  be  siammarized  only  briefly  here.  First, 


195 


Fig.  1.  The  geometry. of  limb  scanning. 

because  of  the  geometry,  there  is  no  contribution  from  atmospheric 
layers  below  the  lowest  point  along  the  ray  path*  Because  of  the 
rapid  drop  of  atmospheric  density  above  the  lowest  point,  called 
the  tangent  point,  very  narrow  weighting  functions  result. 

Secondly,  the  long  slant  patn  leads  to  more  emitter  along  a path 
through  a given  altitude,  and,  thus,  sensitivity  to  higher  alti- 
tudes. A third  advantage  is  that  the  cold  background  of  space 
means  that  all  the  signals  originate  in  the  atmosphere,  and  no 
variability  can  be  attributed  to  the  background. 

Because  the  vertical  resolution  is  coming  from  the  geometric 
effects  in  a real  instrtiment,  the  vertical  field  of  view  must  be 
as  narrow  as  practicable  to  obtain  the  advantages  of  the  inherent 
high  vertical  resolution.  However,  the  spectral  width  can  be  made 
very  broad  in  order  to  get  more  signal.  This  illustrates  a general 
tendency  for  geometric  and  spatial  effects  to  play  the  role  in 
limb  scanning  that  spectral  effects  do  in  nadir  sounding.  In 
operation,  the  radiometer  samples  the  atmospheric  signal  many  times 
during  the  scan  across  the  limb. 
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The  equation  from  which  the  outgoing  limb  radiation  is  cal- 
culated is  given  as 
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where  N(h)  is  the  radiance  received  when  looking  at  tangent  point 
h,  B is  the  Planck  function,  x is  the  coordinate  along  the  ray  path, 
and  x(h,  x)  is  the  transmittance  between  the  spacecraft  and  the 
point  X along  a path  through  h.  By  converting  from  a horizontal 
to  a vertical  integral,  and  rearranging  terms,  one  obtains 
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where  z is  the  vertical  coordinate,  c is  the  concentration  of  the 
absorber  gas,  p the  atmospheric  density,  a the  amount  of  absorber 
along  the  ray  path  between  the  atmospheric  level  and  the  satellite, 
and  subscripts  a and  p refer  to  points  on  the  ray  path  anterior  and 
posterior  to  the  tangent  point-  Equation  (2)  can  be  written 
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where  W is  the  weighting  function  which  tells  how  much  the  level  z 
contributes  to  radiation  observed  along  a path  whose  lowest  point 
is  h. 


Weighting  functions  for  an  ideal  instrument  with  an  infin- 
itesimal field  of  view  have  been  presented  for  a broad  carbon 
dioxide  channel  by  GH.  They  typically  have  widths  at  the  half- 
power points  in  the  order  of  3 km.  These  are  for  an  ideal  instru- 
ment with  an  infinitesimal  field  of  view.  For  a real  instrument, 
with  a nominal  2 km  field  of  view,  the  weighting  functions  are  as 
shown  in  Fig.  2.  Note  that  as  a result  of  convolving  the  infini^ 
tesimal  weighting  functions  with  a finite  field  of  view,  the 
weighting  function  is  now  about  S'  km  wide* 
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Fig,  2.  Weighting  functions  for  idmb  scanning,  with  a finite 
field  of  view.  The  field  of  view  has  a Gaussian  shape,  and 
nominal  2 km  width. 

The  important  thing  to  note  is  that  the  weighting  function 
depends  strongly  on  the  density  at  the  tangent  point.  This  is  the 
reason  for  the  major  nonlinearity;  through  the  hydrostatic  equation, 
temperature  at  levels  below  a geometric  level  will  affect  the 
pressure  and  density  at  the  level,  and,  therefore,  change  the 
weighting  function.  This  is  what  makes  the  temperature  determi- 
nation from  limb  radiance  measurements  the  strongly  nonlinear 
problem  that  it  is- 

Weighting  functions  for  trace  constituents,  such  as  ozone, 
have  been  calculated  by  Gille  and  others  (Ref.  2)  and  by  Russell 
and  Drayson  (Ref.  3).  They  also  display  the  narrow  width  expected 
in  the  limb  geometry.  Once  the  temperature  profile  is  determined, 
the  trace  gas  concentration  measurements  are  a straightforward, 
nearly  linear  problem  as  in  the  nadir  viewing  case. 
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First  to  be  presented  in  this  paper  will  be  results  and 
thoughts  on  the  information  content  of  limb  radiance  measurements. 
This  will  be  followed  by  a description  of  four  inversion  techniques 
currently  being  actively  pursued. 

At  this  time,  an  operational  method  for  inverting  the  data 
obtained  from  the  Nimbus  6 Limb  Radiance  Inversion  Radiometer 
(LRIR)  is  being  developed.  In  a few  months,  it  is  expected  that 
much  more  definitive  comparisons  between  the  different  techniques 
and  quantitative  results  confirming  the  technique,  and  establishing 
the  accuracy  of  the  results,  will  be  available.  In  the  concluding 
section,  some  preliminary  results  of  the  recovery  of  temperature 
and  ozone  profiles  from  the  data  are  shown. 


II.  INFORMATION  CONTENT  OF  LIMB  RADIANCE  MEASUREMENTS 

It  is  desirable  to  be  able  to  assess  the  amount  of  information 
in  a set  of  measurements,  in  order  to  guide  algorithm  development, 
and  to  ensure  that  there  is  a reasonable  match  between  the  effort 
that  goes  into  the  inversion  and  the  information  contained  in  the 
radiance  data.  A technique  to  determine  the  information  content 
has  been  described  by  Gille  and  Bailey  (Ref.  4).  It  will  be 
siommarized  very  briefly  here. 

In  that  reference,  it  was  pointed  out  that  three  effects  have 
the  potential  to  degrade  the  information  that  is  contained  in  a 
set  of  radiance  measurements.  First,  radiative  transfer  itself 
tends  to  obliterate  some  detail.  Second,  the  characteristics  of 
the  measuring  instrument  will  add  further  smoothing  and  reduction 
of  information.  Finally,  the  retrieval  may  also  reduce  the  infor- 
mation that  still  remains-  In  the  reference  cited,  the  first  two 
effects  were  studied. 

The  method  was  to  calculate  the  change  in  the  outgoing  radi- 
ance due  to  sinusoidal  temperature  perturbations  with  2 K amplitude 
and  vertical  wavelengths  from  2 to  14  km.  It  became  clear  that  the 
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features  with  shorter  wavelengths  are  greatly  attenuated  compared 
with  larger  disturbances  just  by  the  radiative  transfer  process. 
This  is  what  one  would  expect  when  considering  the  width  of  the 
weighting  functions,  even  for  an  infinitesimal  field  of  view. 

The  instrument  field  of  view  response  may  be  approximated  by 
a Gaussian,  for  which  one  can  calculate  analytically  a smoothing  of 
the  signal.  One  finds  that  small  vertical  scales  are  further 
reduced.  By  comparing  these  with  the  effects  of  instrumental  noise 
and  the  effects  of  imprecise  sample  spacing,  one  cam  determine  what 
vertical  wavelengths  are  just  at  the  noise  level.  (This  can  also 
be  used  to  determine  how  frequently  the  radiance  profile  needs  to 
be  sampled  for  given  instrument  characteristics.)  Finally,  because 
there  is  a trade-off  between  instrument  field  of  view  and  noise,  it 
is  possible  to  determine  the  optimal  field  of  view  for  a given 
application. 

Another  way  of  looking  at  this  is  to  perform  a spectral  decom- 
position of  limb  scanning  measurements.  An  example  of  such  a spec- 
trxxm  is  shown  in  Fig.  3,  in  which  the  amplitude  of  the  components 
are  plotted  against  spatial  frequency.  For  the  Nimbus  6 geometry, 
one  cycle  per  milliradian  is  a wave  of  4 km  vertical  extent;  0.5 
cycle  per  milliradian  is,  therefore,  a wave  with  8 km  wavelength. 

From  the  previous  discussion,  it  will  be  recognized  that  the 
amplitude  of  the  signal  at  high  frequencies  contains  only  noise, 
and  that  there  is  no  information  about  the  atmosphere  contained  at 
these  frequencies.  One  may  then  apply  an  optimal  filter  (Ref.  5) 
to  determine  what  the  real  amplitude  might  be.  In  Fig.  3,  the 
circles  represent  the  unfiltered  signal,  whereas  the  symbol  F indi- 
cates the  result  of  applying  an  optimal  filter.  If  one  knows  some- 
thing about  the  modulation  transfer  function  of  the  instrument, 
then  one  can  do  a certain  amount  of  boosting  of  the  middle  fre- 
quencies (to  compensate  for  the  smoothing  introduced  by  the  instru- 
ment) . It  is  even  possible  that  the  atmospheric  smoothing  can  be 
partially  compensated.  Clearly,  however,  one  must  have  a high 
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Fig.  3,  Amplitude  spectrum  of  the  narrow  CO^  channel  limb 
radiance  profile.  Circles  indicate  the  original  spectrum;  F's 
show  the  spectrum  after  optimal  filter  is  applied . 

signal  to  noise  ratio  in  order  to  be  reasonably  confident  that  one 
is  actually  increasing  the  realism  of  the  reconstructed  radiance 
profile. 


III.  ITERATIVE  INVERSION  TECHNIQUE 


The  equation  for  the  limb  radiance  profile  can  be  transformed 
to  (GH) 
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where  the  pressure  is  at  the  reference  level  z^.  The  exponential 

terms  show  the  effect  of  the  hydrostatic  equation  on  the  pressure, 
and  the  nonlinear  dependence  of  radiance  on  the  temperature  pro- 
file, Because  of  this,  and  the  sharply  peaked  weighting  function, 

GH  utilized  the  iterative  inversion,  technicjue  of  the  Chahine  type. 
This  is  very  fully  described  in  GH . and  will  not  be  gone  into  in 
greater  detail  here.  Some  features  of  the  iterative  scheme  are 
presented  in  Ref  6,  Perhaps  most  interesting  is  that  for  the 
implementation  used  by  GH,  although  radiance  residuals  continue  to 
drop  with  continued  iteration,  temperature  errors  went  through  a 
minimum  and  then  began  to  grow.  Another  feature  was  that  in  early 
stages  of  the  iteration  radiance  residuals  oscillated  as  a function 
of  iteration  number,  presumably  as  the  atmosphere  "sloshed"  while 
it  adjusted  itself  to  the  hydrostatic  equation. 

This  method  gives  good  results,  which  are  nearly  independent 
of  the  initial  guess.  The  major  problem  as  far  as  reducing  a 
large  amount  of  data,  however,  is  the  large  number  of  iterations 
required  to  drive  the  residuals  below  the  root  mean  square  (rms) 
radiance  error.  An  iterative  approach  must  be  made  very  fast  in 
order  to  process  large  amounts  of  data. 

Gille  and  others  (Ref.  2)  also  applied  the  iterative  inversion 
to  constituent  profiles.  For  this  problem,  the  results  were  fairly 
fast  in  terns  of  the  number  of  iterations;  only  two  to  four  itera- 
tions were  generally  required  to  get  the  residuals  below  the 
radiometric  error.  However,  the  calculation  of  the  radiance  is 
still  a very  time-consuming  process.  Two  variations  of  this 
approach  might  be  noted;  Gille  and  others  (Ref.  2)  corrected  the 
profile  at  each  level,  based  only  on  the  measurement  at  that  level. 
Tallamraju  (Ref.  7)  used  a corrector  for  a level  based  on  all  the 
radiances. 
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IV.  DIRECT  INVERSION 


In  the  early  studies  of  the  limb  radiance  problem,  it  was 
recognized  that  a simple  first-order  inversion  method  was  possible 
due  to  the  highly  localized  nature  of  the  weighting  function 
(Ref.  8) . This  behavior  provided  the  motivation  for  the  corrector 
equation  in  the  iterative  scheme  used  by  GH.  More  recently,  a 
similar  principle  has  been  suggested  for  use  with  pressure  modu- 
lated limb  scanning  radiometer  proposed  for  Nimbus  G, 

As  described  by  House  and  Ohring  (Ref.  8) , the  radiance  at 
a tangent  height  h may  be  approximated  by 

N(h)  - B(T(h))e(h)  (5) 

where  e (h)  is  the  effective  atmosphere  emittance.  If  the  emittance 
is  known,  the  temperature  may  be  recovered  from  the  Planck  func- 
tion. If  the  constituent  concentration  is  sought,  the  temperature 
must  be  known  as  well  as  a relationship  between  emittance  and 
mixing  ratio . 

For  an  isothermal,  constant  mixing  ratio  atmosphere  with  con- 
stant absorption,  the  emittance  is  given  by  Burn  and  Upplinger 
(Ref.  9) 

e (p  (h)  , W,  T)  = 1 - exp  ( ) (6) 

/t 

where  p (h)  is  the  pressure  at  tangent  height  h,  W is  the  mixing 
ratio,  T is  the  temperature,  and  K is  the  absorption  coefficient. 

The  localization  of  the  radiance  contribution  to  the  vicinity 
of  the  tangent  point  is  such  that  virtually  all  the  radiance 
comes  from  within  two  scale  heights  of  the  tangent  point  for 
moderate  or  weak  absorption.  On  the  order  of  80  to  90%  comes  from 
within  one  scale  height  of  the  tangent  point.  In  a more  realistic 
situation,  where  the  absorption  coefficient  is  directly  proportional 
to  pressure,  the  degree  of  localization  will  be  enhanced  since  the 
largest  pressures  occur  at  the  tangent  point.  The  radiance  at 
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tangent  height  h should  then  be  approximated  by  the  form 


N(p(h))  B(T(h)) 


1 


exp 


(-K(p(h))p(h)W(h) 
/ f (h) 


(7) 


where  the  variables  T(h)  and  W(h)  represent  temperature  and  mixing 
ratio  averaged  in  some  manner  over  the  region  of  the  atmosphere 
from  which  the  radiance  arises.  By  assuming  the  radiance  all  comes 
from  within  one  scale  height  of  the  tangent  point, 


T(h) 


W(h) 


rh  + H 

rh  + H 

T(z)A(z)dz 

A(z) dz 

•*h  J 

h 

j-h  + H 

rh  + H 

W(z)A(z)dz 

A(z)dz  ^ 

Jh  J 

h 

(8) 


where  A(z)  is  an  arbitrary  weight  depending  upon  the  optical 
characteristics  of  the  atmosphere  in  the  vicinity  of  h. 

The  effective  absorption  coefficient  may  be  difficult  to 
accurately  specify  and  could  most  easily  be  obtained  empirically 
from  detailed  forward  radiance  calculations.  The  most  useful  form, 
however,  is  to  have  the  effective  emittance  specified  in  terms  of 
the  atmospheric  variable  of  interest 

£(P,  T,  W)  = (9) 

B(T) 

In  the  case  of  CO^f  the  mixing  ratio  profile  is  assumed  to  be 
globally  uniform  and  does  not  need  to  be  considered  explicitly. 
Temperature  variability  also  appears  to  be  of  second  order  impor- 
tance and  can  be  neglected  as  an  explicit  parameter.  This  results 
in  a simple  relationship 

N(p)  = B(T(p))e(p)  (10) 

where  B(T)  is  the  instrument  response  weighted  Planck  function, 
which  may  be  parameterized  and  inverted  to  give  T(B) . The  e (p)  are 
determined  from  Eq.  (9)  by  calculating  N(p)  and  T (using  Eq.  8) 
for  a set  of  atmospheres.  The  effective  emissivity  for  LRIR 
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Fig.  4.  Effective  emissivity  of  the  narrow  carbon  dioxide 
channel  for  midlatitude  summer  standard  atmosphere/  as  a function 
of  pressure.  (1  bar  =100  kPa.) 

Channel  1 (narrow  CO^)  is  shown  in  Fig.  4 for  the  suiraner  mid- 
latitude standard  atmosphere.  The  effective  emissivities  for  other 
atmospheres  fall  very  close  to  the  same  curve.  The  effective 
emissivity  for  Channel  2 (broad  CO^)  has  a similar  shape,  but  is 
displaced  toward  lower  emissivity  and  higher  pressures. 

It  can  be  seen  that  values  for  the  emissivity  become  greater 
than  one  as  the  channel  becomes  opaque.  In  reality,  the  effective 
temperature  for  these  pressures  should  be  derived  from  a much 
deeper  layer  of  the  atmosphere  which  is  appreciably  warmer  than 
the  region  near  the  tangent  point. 
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For  nonuniformly  mixed  constituents,  Eq.  (9)  is  solved  for 
e(P/  T,  W) , which  is  then  interpreted  to  give  W as  a function  of 
pressure.  Inversion  for  temperature  using  a uniformly  mixed  con- 
stituent like  CO^  can  proceed  in  several  ways.  If  the  radiance 
N(p)  is  known  at  pressure  p,  the  emissivity  and  hence,  B(T(p)) 
and  T(p)  can  be  obtained  directly. 

If  the  radiance  is  known  to  correspond  to  some  weighted  tem- 
perature, the  pressure  corresponding  to  the  resulting  emissivity 
can  be  obtained  from  the  emissivity  curve.  For  constituents  such 
as  ozone,  the  mixing  ratio  can  be  obtained  if  the  radiance  is 
known  to  correspond  to  a specified  temperature  and  pressure.  In 
this  situation,  it  is  convenient  to  fit  emittance  in  the  form  shown 
in  Eq.  (7) 

e(P,  T,  w)  = e(E^) 

In  application,  CO^  radiance  will^  not  be  known  as  a.  function 
of  either  temperature  or  pressure.  Both  parameters  must  be  deter- 
mined from  observations  made  in  two  different  spectral  channels  in 
the  15-ym  CO^  band  having  different  optical  properties. 

If  both  channels  are  looking  at  a level  in  the  atmosphere 
where  one  of  the  channels  is  becoming  opaqae,  the  effective  radia- 
ting temperature  of  the  opaque  channel  is  very  close  to  the  atmo- 
spheric temperature  at  that  level.  By  using  this  temperature,  the 
emissivity  for  the  transparent  channel  and  hence  the  pressure  at 
that  level  may  be  determined.  If  the  angular  separations  between 
samples  on  the  radiance  profiles  are  known,  the  entire  profile  of 
effective  temperature  against  pressure  may  be  reconstructed  by 
applying  the  hydrostatic  equation,  the  spectrally  weighted  Planck 
function,  and  the  curves  of  emissivity  against  pressure.  This  is 
shown  schematically  in  Fig.  5. 

Depending  upon  the  averaging  function  A(z)  used  in  Eq.  (8) , 
the  T(p)  profile  may  be  shifted  an  appropriate  amount  in  P to 
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radiance 


Fig,  5.  Schematic  of  the  direct  inversion  scheme^  Point 
where  narrow  channel  is  saturated  allows  determination  of  the 
temperature f from  which  the  broad  channel  emissivity  £.^(p)  , and, 
therefore , p(^^)  “ determined, 

provide  an  estimate  of  the  T(p)  profile.  Thus,  the  technique  has 
great  potential  for  providing  initial  profiles  for  more  exact 
inversion  schemes. 

Once  the  temperature  and  pressure  profiles  have  been  deter- 
mined, a straightforward  application  of  Eqs.  (9)  and  (12)  can  be 
used  to  obtain  W(p) . Again,  a shift  in  pressure  compatible  with 
the  weights  used  in  Eq.  (S)  will  result  in  an  approximate  W(p)  pro- 
file. 

V.  STATISTICAL  RETRIEVALS 

Another  approach  which  we  have  explored  is  the  use  of  sta- 
tistical relationships.  If  one  writes  the  measured  radiances  as  a 
column  vector,  the  radiative  transfer  equation  may  be  written 
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(12) 


N(h^)  = 


C.  .B. 
ID  D 


where  the  B^'s  are  the  Planck  functions,  subscripts  indicate  levels 
and  C contains  the  information  about  the  distribution  of  material 
(including  the  hydrostatic  equation)  and  transmittance  of  the  gases. 
In  this  form,  C is  required  to  include  most  of  the  nonlinearity. 

This  coefficient  matrix  can  be  calculated  from  synthetic  data,  cal- 
culated in  turn  from  a wide  range  of  atmospheric  profiles.  Once 
it  is  known,  it  can  be  applied  to  real  data  to  determine  the  Planck 
function  and,  therefore,  the  temperature.  It  is  a rather  empirical 
approach,  but  it  appears  to  work  moderately  well  and  it  was  very 
fast. 


A problem  with  this  approach  is  that  by  putting  all  the 
nonlinearity  into  the  coefficient  matrix,  one  may  be  left  with 
larger  errors  than  desired.  A more  interesting  idea  to  expand  the 
true  limb  radiance  profile  as  a linear  combination  of  approximate 
(nonlinear)  radiance  profiles 

"T'fhi*  - ■=i3"A'i>hj>  - (13) 

Here  the  tactics  are  to  incorporate  the  nonlinearity  in  a known, 
interpretable  way  into  and  to  keep  D.  . linear  or  nearly  so.  A 
special  case  may  be  noted — when  = ^ij ' again  have  the  direct 

inversion,  described  earlier.  When  the  elements  of  D are  found  by 
regression,  then  we  have  a full  linearized  statistical  inversion. 

Now,  by  applying  D \ Be  is  determined.  By  knowing  p,  e and 
B are  immediately  known.  The  nonlinearity  is  now  incorporated  in 
the  e(p) . The  trick  is  to  find  the  best  way  to  incorporate  a 
large  part  of  the  nonlinearity  in  an  analytically  manageable  form 
so  that,  after  a linear  inversion,  the  nonlinear  form  may  be 
analytically  interpreted  with  small  errors. 

Some  preliminary  results  indicate  that  Eq.  (13)  does  indeed 
give  somewhat  better  results  than  Eq.  (12)  for  temperature,,  and 
considerably  better  for  ozone.  One  of  the  problems  that  tends  to 
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Fig.  6.  Preliminary  temperature  retrieval  for  Wallops  Island 
for  29  July  1975  compared  with  in-situ  measurements.  The  solid 
line  is  the  rocketsonde  plus  radiosonde  measurement ^ made  at  1803 
Greenwich  mean  time  (GMT) - The  points  are  from  two  retrievals  of 
limb  radiance  for  the  LRIR  over  pass  at  1737  GMT.  x's  and  o's  are 
for  retrievals  before  and  after  a correction  for  atmospheric  non-- 
sphericity.  (1  bar  = 100  kPa.) 

arise,  however,  is  that  although  one  can  deal  with  75%  of  the  atmo- 
spheric cases  fairly  well,  the  other  25%  (which  includes  things 
like  stratospheric  warmings  and  other  interesting  cases)  are  not 
handled  well.  It  is  very  desirable  to  develop  a technique  that  is 
general  enough  to  handle  all  these  cases  without  requiring  separate 
treatment. 
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29  July  1975 
WALLOPS  ISLAND 


OZONE  MIXING  RATIO,  PPMV 


Fig,  7.  Preliminary  ozone  retrieval  for  Wallops  Island  for 
29  July  1975 f compared  with  in-situ  measurements.  The  points  are 
based  on  a Krueger  optical  rocketsonde  launched  at  1717  GMT,  a 
Hilsenrath  chemiluminescent  rocketsonde  launched  at  1920  GMT  and  a 
balloon  ozonesonde  launched  at  2051  GMT,  The  solid  line  is  from 
the  retrieval  of  LRIR  radiances  measured  at  1737  GMT, 

(1  bar  = 100  kPa,) 
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VI . CONCLUDING  REMARKS 

As  noted  above,  many  months  of  real  data  are  now  in  the  process 
of  being  analyzed.  Much  of  the  past  year  has  been  spent  in  pro- 
cessing the  raw  data  in  order  to  have  good,  calibrated  radiances 
with  which  to  work.  At  present,  a search  is  being  conducted  for 
an  inversion  method  that  will  have  the  ability  to  extract  the  large 
amount  of  information  contained  in  the  limb  scans,  but  with  accept- 
able computer  time. 

As  an  indication  of  the  progress  to  date,  and  the  inforraation 
obtainable  by  limb  scanning.  Figs.  6 and  7 show  temperature  and 
ozone^  retrievals, respectively,  at  Wallops  Island  in  July,  1975. 

They  are  compared  with  rocket  soundings  taken  within  three  hours 
of  the  satellite  overpass.  The  temperature  profile  and  the  rocket 
are  in  reasonably  good  agreement  from  25  to  55  km.  The  root-mean- 
square  (rms)  error  is  about  3 K which  is  close  to  the  experiment 
objective.  Similarly,  the  ozone  profile  in  Fig.  7 looks  very- 
reasonable.  If  one  takes  the  percentage  difference  between  the 
ozone  retrieval  and  the  optical  rocket  measurement  at  each  level, 
and  then  takes  the  rms  value  of  that  percentage  difference,  one 
gets  14%.  This  is  close  to  the  claimed  accuracy  of  the  rocket 
instrument. 

These  results  to  date  give  us  great  confidence  that  the  infra- 
red limb  scanning  approach  is  very  powerful  and  technically 
feasible.  They  also  indicate  that  the  data  quality  from  LRIR  is 
such  that  large  amounts  of  new  data  will  be  available  to  refine 
limb  scanning  techniques  and  to  study  the  upper  atmosphere  in 
hitherto  unobtainable  detail. 


See  Footnote  1 in  Discussions. 
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SYMBOLS 


gravitational  acceleration 
true  limb  radiance 
approximate  limb  radiance 
gas  constant  for  air 
temperature 

coordinate  along  the  ray  path 
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DISCUSSIONS 


King:  As  you  Jcnow  the  geometry  of  limb  viewing  truncates  the 

kernel  which  has  the  effeict  of  imposing  the  tangent  height  as  the 
bound  of  your  intensity  integral.  This  finite  bound  converts  the 
integral  equation  from  a Fredholm  to  a Volterra  type-  Now  for 
fairly  general  classes  of  kernels  of  this  truncated  type,  the 
Volterra  equation  has  direct  solutions  which  do  not  involve  matrix 
inversions-  One  could  progressively  work  down  to  retrieve  the 
temperature,  if  the  true  atmospheric  kernel  is  sufficiently  approxi- 
mated by  a soluble  class  of  Volterra  kernels-  Have  you  tried  this? 

Gille:  Not  explicitly.  I think  one  of  the  major  problems  is, 

first  of  all,  knowing  where  you  are  starting.  If  you  have  that 
point,  then  working  up  and  down  from  there  is  what  I described  in 
the  case  of  what  we  called  the  direct  method.  If  one  already  has 
radiance  as  a function  of  pressure  (which  is  the  hard  part) , then 
working  down  would  be  the  "onion  peeling"  approach,  which  is 
equivalent  to  inverting  a diagonal  matrix.  We  have  done  a little 
work  on  this,  and  expect  to  do  more.  There  is  a problem  with  sensi- 
tivity to  noise  at  the  top  levels.  We  have  not  tried  anything  like 
an  analytic  approach  because  the  transmittances  are  not  accurately 
approximated  by  closed  form  expressions.  It  sounds  like  something 
worth  looking  into,  however. 

Chahlne:  I have  two  questions  and  one  comment-  Your  weighting 

functions  are  the  most  beautiful  weighting  functions  that  I have 
seen  today.  They  might  not  be  aesthetic  but  they  are  mathematically 
beautiful.  I would  like  to  have  seen  a comparison  between  the 
results  obtained  by  the  three  or  four  methods  you  have  described. 

If  you  tried  the  relaxation  and  the  emissivity  approach,  would  you 
get  large  variations  in  your  solution? 

Gille:  1 think  the  variations  would  be  rather  small-  I think  one 

of  the  major  differences  would  be  computer  time,  which  might  be 
rather  different  for  different  kinds  of  approaches.  We  don’t  have 
that  kind  of  comparison.  That’s  one  of  the  things  I was  alluding 
to  when  I said  in  two  or  three  months  I think  we  will  have  a good 
deal  more.  I think  many  of  these  are  now  getting  to  the  point 
where  we  can  try  them  not  only  on  synthetic  data  but  on  some  of  the 
real  data  and  see  how  well  they  do. 

Chahlne : The  second  question  is  on  the  emissivity  coefficients  you 

have  described.  And  you  have  said  they  are  fairly  independent  of 
temperature.  I am  surprised  because  in  the  stratosphere  you  can 
have  large  variations  in  temperature,  from  day  to  day  or  season  to 
season.  Were  you  able  to  determine  that  the  emissivity  coefficient 
really  was  a weak  function  of  temperature  even  for  variations  of 
20  degrees? 
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Gille:  The  first  order  effect  is  that  if  you  are  looking  at  a geo- 

metric altitude,  the  atmospheric  pressure  or  density  varies  as  the 
temperature  changes,  through  hydrostatic  adjustment.  That  is  the 
thing  you  take  out  by  plotting  versus  pressure.  Then  it  is  things 
like  the  variation  of  line  intensities  over  the  band,  and  changes 
in  the  intensities  of  overtone  bands  that  matter.  While  these 
effects  make  a difference,  they  don't  add  a lot  of  spread  to  the 
emissivities . 

Sussklnd:  Is  it  broad  banded? 

Gille:  Yes,  the  channels  are  quite  broad,  of  the  order  100-200  cm 

Mateer:  I was  a little  disappointed  that  you  didn't  show  any 

results  of  ozone  profiles,  John.^ 

Gille:  The  focus  of  this  meeting  has  been  on  methods  and  I felt  I 
should  talk  about  methods  rather  than  results,  Carl.  Also,  I was 
involved  in  a NASA  meeting  at  NCAR  until  just  before  I left,  and 
some  slides  I meant  to  bring  were  inadvertantly  left  behind.  Our 
ozone  profiles,  although  I can't  show  you  one,  look  quite  reason- 
able- We  have  results  from  about  20  km  up  to  55  km,  with  a maximum 
in  mixing  ratio  between  30-35  km  in  mid-latitudes.  We  have  a few 
cases  of  simultaneous  rocket  data  for  comparison,  but  have  only 
looked  hard  at  a comparison  from  Wallops  Island  where  there  was  a 
simultaneous  flight  by  two  rocket  ozone  sensors.  The  satellite 
and  rocket  retrievals  had  an  rms  percentage  difference  between 
20  and  50  km  of  14%.  That  is  about  what  is  claimed  for  the  accuracy 
of  the  rocket  measurements. 

Kaplan:  I am  surprised,  Carl,  that  you  haven't  asked  him  about  the 

variation  from  day  and  night  and  at  twilight  and  dawn*  Do  you  have 
any  results  on  ozone  changes? 

Gille:  The  ozone  top  at  the  moment  is  at  about  55  kilometers  and 

we  don't  see  any  extremely  large  effects  there.  There  could  be 
effects  of  perhaps  20  percent.  We're  still  checking  it  out. 

Kaplan:  Do  you  see,  or  don't  you  want  to  say  yet,  dawn  and  twi- 

light effects  at  all? 

Gille:  We  appear  not  to  see  them.  In  fact,  I can  tell  you  one 
problem  that  does  occur.  We  look  at  the  day  side,  looking  back 
not  along  the  orbit  plane  but  30*^  off  it,  north  to  south  at  a 


In  response  to  Dr.  Mateer* s comment,  the  authors  included, 
after  the  Workshop,  their  ozone  retrieval  results  (Fig.  7) . 
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given  altitude.  Going  south  on  the  night  side,  we  are  looking 
south  to  north.  Some  differences  between  day  and  night  are  due  to 
the  fact  that  the  viewing  geometry  is  slightly  different  and  one 
can  see  small  effects  of  gradients,  but  at  the  altitudes  we  are 
talking  about  those  effects  are  not  there.  By  and  large,  up  to 
55  there  does  not  seem  to  be  any  very  significant  effect.  What 
we  really  want  to  do  is  push  up  a little  bit  higher  and  I think 
the  signal  to  noise  ratio  will  allow  us  to  do  it.  I think  there 
we  ought  to  be  able  to  do  it. 
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INVERSION  OF  SCATTERED  RADIANCE  HORIZON 


PROFILES  FOR  GASEOUS  CONCENTRATIONS 
AND  AEROSOL  PARAMETERS 


Harvey  L.  Malchow  and  Cynthia  K.  Whitney 
Charles  Stark  Draper  Laboratory , Inc. 


This  paper  presents  techniques  that  have  been  developed 
and  used  to  invert  limb  scan  measurements  for  vertical  pro- 
files of  atmospheric  state  parameters.  The  parameters  which 
can  be  found  are  concentrations  of  Rayleigh  scatterers ^ ozone, 
N02f  3.nd  aerosols,  and  aerosol  physical  properties  including 
a Junge-size  distribution  parameter  and  real  and  imaginary 
parts  of  the  index  of  refraction.  The  novel  techniques  developed 
for  this  problem  should  be  of  interest  for  nonlinear  numerical 
search  problems  in  general. 


1.  INTRODUCTION 

There  is  growing  scientific  opinion  that  physical  processes 
in  the  earth's  stratosphere  are  of  vital  importance  to  man  and  the 
biosphere,  and  that  far  too  little  is  understood  about  these  proc- 
esses. Examples  include  the  interactions  and  the  resulting 
balance  between  stratospheric  ozone,  NO2  and  aerosols,  which 
affect  the  ultraviolet  radiation  and  the  temperature  environment 
at  the  Earth's  surface.  It  is  widely  appreciated  that  a proper 
understanding  of  these  phenomena  and  their  consequences  will 
require  a significant  body  of  new  experimental  data,  and  that  the 
scope  of  the  requirement  is  such  that  remote  sensing  by  satellite 
offers  the  most  practical  approach. 
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A variety  of  remote  sensing  techniques  based  on  radiometric 
data  is  available  for  stratospheric  monitoring.  It  is  possible  to 
consider  any  combination  of  extinction,  emission, and  scattering  as 
potential  signal  generating  phenomena.  Furthejonore , in  the  cases 
of  emission  and  scattering,  it  is  possible  to  consider  any  com- 
bination of  vertical,  horizontal,  or  inclined  scan  directions  for 
obtaining  the  information. 

This  paper  reports  on  new  data  processing  techniques  that 
were  developed  for  one  of  the  many  possible  stratospheric  monitoring 
techniques.  It  is  anticipated,  however,  that  some  of  the  tech- 
niques presented  here  will  be  useful  for  the  other  types  of  experi- 
ments as  well. 

In  the  problem  addressed  here,  the  measurements  consist  of 
multispectral  limb  scans  of  visible  scattered  sunlight.  Figure  1 
illustrates  the  experiment  geometry,  and  Fig.  2 illustrates  a 
typical  simulated  "measurement  data"  set.  These  measurements  are 
inverted  for  the  atmospheric  state ^ which  is  comprised  of  vertical 
profiles  of  atmospheric  parameters,  within  the  altitude  regime  of 

Incidence  Solar 

\ 

Detector 


Fig.  i.  Scattered  radiance  limb  scan  geometry. 
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Scan  Angle 


Fig,  2,  Multispectral  Limb  Scan  Measurements, 

observations-  The  parameters  are  the  concentrations  of  Rayleigh 
scatterers,  ozone,  NO^  and  aerosols,  and  the  aerosol  physical  pro- 
perties, such  as  Junge  size  distribution  parameter  (within  the 
optically  active  size  range)  and  the  real  and  imaginary  parts  of  index 
of  refraction. 

Inclusion  of  so  many  state  parameters  may  seem  ambitious,  but 
this  is  necessary  because  they  are  all  optically  active  over  the 
same  band  of  visible  wavelengths.  Therefore,  it  is  not  possible 
to  invert  for  a subset  of  these  parameters  except  by  assuming 
values  for  the  remaining  ones.  The  ability  to  treat  them  all  is, 
in  fact,  a significant  advantage  of  the  limb  scan  experiment. 

Examples  of  the  inversion  results  for  simulated  experiments  are 
presented  here  to  demonstrate  the  viability  of  the  inversion  tech- 
nique. 
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II.  GENERAL  FRAMEWORK 


The  limb  scan  inversion  problem  has  some  features  which  are 
similar  to,  and  some  which  are  different  from,  other  inversion' 
problems  in  atmospheric  sciences.  The  purpose  of  this  section  is 
to  set  a framework  for  discussing  the  limb  scan  problem  by  focus- 
sing on  those  features  which  are  shared  with  other  inversion  prob- 
lems, and  to  note  the  standard  techniques  which  are  applicable  to 
them- 

A major  similarity  between  the  limb  scan  and  other  inversion 
problems  is  the  inevitable  presence  of  noise  in  the  measurements - 
Let  represent  the  actual  atmospheric  state,  and  z(x^)  represent 
the  measurement  as  predicted  from  the  state.  The  actual  measure- 
ment observed  is 

z = z (x  ) + n 
a 

where  n represents  noise.  The  above  is  called  the  measurement 
equation-  If  linearized  about  some  x^  by  Taylor  expansion  with  the 
partial  derivative  H,  it  takes  the  same  form  as  that  which  occurs 
in  many  other  inversion  problems  where  a Fredholm  integral  is 
replaced  by  a quadrature: 

The  problem  of  extracting  the  state  x^  from  this  equation  has  been 
approached  in  various  ways  by  Deutsch  (Ref.  f) , Twomey  (Ref.  2)  , 
Mateer  (Ref.  3),  Westwater  and  Strand  (Ref.  4),  Rodgers  (Ref.  5) 
and  others. 

The  presence  of  noise  suggests  that  a stCK:hastic  approach  to 
the  problem  is  appropriate.  In  fact,  one  can  argue  that  not  only 
are  the  measurements  random  variables  due  to  the  noise,  but  also 
that  the  state  is  a random  variable  drawn  from  an  ensemble  of 
possible  states.  The  well-known  theory  of  optimal  estimation 
offers  a general  framework  for  addressing  most  such  inversion  prob- 
lems, and  it  is,  in  fact,  the  approach  adapted  here  to  the  limb 
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scan  inversion  problem.  This  common  point  of  departure  is  reviewed 
and  related  to  other  work  in  the  remainder  of  this  section. 

A way  of  defining  an  optical  state  estimate  is  suggested  by 

the  Gauss-Markov  theorem,  which  states  that  a minimum  variance, 

unbiased  estimate  of  x can  be  obtained  as  a linear  function  of 

a 

measurements  by  minimizing  a Euclidian  distance.  Various  authors 
use  various  names  for  the  construct  playing  this  role.  One  common 
name  is  "cost  function,"  expressing  the  desirability  of  minimizing 
it.  Defining  a Euclidian  distance  generally  requires  two  basic 
choices:  the  quantities  to  measure  the  distance  between,  and  the 

positive  definite  metric  to  use.  This  means  choosing  the  terms  to 
put  in  the  cost  function,  and  the  weights  to  give  them.  These 
choices  are  resolved  by  considering  what  is  done  with  the  cost 
function. 

A typical  way  to  minimize  the  cost  function  is  to  differentiate 
with  respect  to  the  state  estimate,  set  the  derivative  to  zero, 
and  solve  for  the  estimate.  To  provide  an  estimate  that  can  actu- 
ally be  evaluated,  it  is  necessary  that  the  cost  function  involve 
only  known  variables  (and  not,  for  instance,  the  unknown  actual 
state) . A well-defined  estimate  is  guaranteed  by  requiring  that 
each  term  in  the  cost  function  involve  the  deviation  between  a 
known  value  of  a quantity  and  the  known  value  that  quantity  would 
be  expected  to  have  if  the  state  were  known  to  be  equal  to  the 
estimate.  Typically,  each  such  term  is  multiplied  by  the  inverse 
of  the  variance  of  that  quantity.  That  is,  distance  is  measured 
between  known  values  and  expected  values  that  depend  on  the  state 
estimate  and  variances  define  the  metric  coefficients. 

In  the  limb  scan  problem,  there  are  many  atmospheric  param- 
eters in  the  state  and  many  wavelength  channels  and  altitudes  for 
measurement.  It  is  appropriate  to  use  vectors  for  state  and 
measurement,  and  covariance  matrices  for  expressing  uncertainties 
of  each.  With 
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= actual  vector  cjuantity 

<q>  = vector  quantity  expected  if  state  = estimate 
= quantity  covariance 

the  typical  cost  function  teirm  is 

(q  - <q>)'^  C"^  (q^  - <q>) 
a q a 

where  T indicates  transpose - 

Generally,  the  measurement  vector  contributes  the  leading 
term  in  the  cost  function.  Some  authors  include  various  additional 
terms  as  well.  Often  there  is  a term  representing  a prior  esti- 
mate of  the  state  vector.  It  should  be  noted  that  exclusion  of 
such  a tezm  is  only  a special  case;  zero  prior  estimate  and  infi- 
nite prior  covariance.  The  fact  that  the  prior  estimate  is  always 
implicitly  present  makes  the  optimal  estimation  procedure  poten- 
tially recursive.  The  optimal  state  estimate  and  its  covariance 
obtained  after  one  batch  of  data  could  be  used  as  the  prior  esti- 
mate and  covariance  before  another  batch  of  data. 

This  idea  of  recursion  leads  naturally  to  consideration  of 
cases  where  the  state  is  evolving  with  some  running  variable  v. 
(Examples  of  such  a variable  include  scattering  particle  radius, 
scattering  angle,  altitude,  longitude,  latitude,  and  time.)  When 
there  is  evolution,  it  may  be  appropriate  that  the  prior  estimate 
at  V be  determined  or  at  least  modified  by  the  posterior  estimate 
at  V - Av.  This  is  accomplished  by  some  authors  by  including  teimis 
in  the  cost  function  representing  one  or  more  derivatives  of  the 
state  with  respect  to  the  running  variable.  These  terms  have  the 
same  effect  as  modifying  the  prior  estimate  at  v to  align  better 
with  the  posterior  estimate  at  v - Av;  thus,  the  posterior  esti- 
mate at  V is  also  closer  to  that  at  v - Av,  and  the  whole  function 
is  more  smoothly  behaved.  In  the  case  of  the  limb  scan  inversion, 
the  point  of  the  experiment  is  to  find  excursions  from  rather  than 
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parameters  of  a smooth  model,  so  no  smoothing  is  involved,  and  no 
derivative  terms  appear  in  the  cost  function. 

The  following  notation  will  be  a convenient  basis  for  subse- 
quent discussion.  Let  the  vector  x be  the  state  estimate  and  the 
matrix  P be  its  covariance-  The  prior  state  estimate  is  x^  with  a 

Of  0 

covariance  P^.  Because  of  noise,  the  measurement  vector  z has  a 

O;0 

covariance  matrix  R.  If  the  state  were,  indeed,  known  to  be  equal 
to  X,  then  the  expected  values  of  x^  and  z would  be  x and  z(x). 

The  cost  function  is  thus 

T -1 

Cost  function  = (z  - z (x) ) R (z  - z(x)) 

T -1 

+ (x^  - x)  p (x  - x) 

0 0 

With  z (x)  linearized  about  x^  and  with  partial  derivative 
matrix  H,  the  optimal  state  estimate  is  found  to  be 

where 

T T -1 

K = P^  H (H  P^  H + R) 

Intuitively  reasonable  behavior  for  K is  demonstrated  by  noting 
that  for  large  noise  covariance  R,  K is  small  and  x is  largely 
determined  by  x^.  The  measurement  has  influence  only  in  proportion 
to  its  trustworthiness . 

The  gain  matrix  K is  sometimes  called  the  Kalman  gain  because 
the  procedure  being  discussed  here  is  a special  case  of  the  well- 
known  Kalman  filter  (Ref .6) . In  the  typical  Kalman  filter  problem, 
the  state  evolves  with  a running  variable  v,  and  the  prior  estimate 
Xq(v)  is  obtained  from  the  posterior  estimate  x (v  - Av)  by  using  a 
dynamic  model  for  the  evolution  over  Av.  Use  of  such  a dynamic 
model  can  have  the  effect  of  smoothing,  so  that  aspect  of  the 
Kalman  filter  formalism  has  not  been  applied  to  the  limb  scan  prob- 
lem- 
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Once  the  optimal  state  estimate  x is  determined,  its  covariance 
P follows  by  simply  substituting  the  expression  for  x into  the  defi- 
nition 

P = < (x  - X ) ^> 

^ a 

and  evaluating  the  expectations.  The  result  is 

The  covariance  update  provides  small  P whenever  R is  small.  For 
R = 0, 

T T -1 

KH  = PH  (HPH) 

Premultiplying  by 

T -1  T 
I = (H  H)  (H  H) 

Of  ^ ^ a.  Of 

and  rearranging  parentheses  establishes  K H = ^ and  = 0 in  the 
limit.  That  is,  there  is  no  uncertainty  at  all  concerning  the 
state  after  a noiseless  measurement.  The  P,  in  general,  represents 
residual  state  uncertainty  which  remains  because  the  measurement  was 
not  noise  free. 

In  the  case  of  highly  nonlinear  problems,  the  Taylor  series 
expansion  z (x)  = 2(x^)  + (^)  (x  - x^)  is  not  very  accurate  and 
as  a result,  the  posterior  state  estimate  may  not  even  approximately 
reproduce  the  observed  measurement.  The  standard  technique  for 
overcoming  such  a difficulty  is  to  make  many  iterations  on  the  same 
data,  with  the  partial  derivatives  and  the  state  estimate  (but  not 
the  covariance)  updated  at  each  iteration.  The  process  is  commonly 
called  multiple  local  iteration.  Iteration  i:^to  be  distinguished 
from  recursion,  iteration  being  applied  to  overcome  nonlinearity  and 
recursion  being  applied  to  smooth  over  noise. 

The  above  remarks  review  the  standard  aspects  of  nonlinear 
optimal  estimation  that  are  applicable  to  the  limb  scan  inversion 
problem.  There  are,  however,  a nxamber  of  features  of  the  limb  scan 
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inversion  problem  which  tend  to  distinguish  it  from  other  inversion 
problems  and  require  special  techniques.  These  are  the  subjects  of 
the  following  sections. 


III.  OVERCOMING  COMPUTATIONAL  PROBLEMS  ASSOCIATED  WITH  LIMB  SCANS 

IN  PARTICULAR 

Computational  problems  arise  in  the  limb  scan  inversion  prob- 
lem because  of  requirements  to  (1)  perform  matrix  inversions  in 
computing  the  optimal  state  estimate,  and  (2)  perform  radiative 
transfer  simulations  to  compute  expected  measurements  and  partial 
derivatives. 

The  required  matrix  inversions  are  difficult  first  because  of 
the  inherent  large  dimensionality  of  the  problem.  Typically  there 
may  be  from  ten  to  one  hundred  scan  positions,  with  several  state 
variables  and  several  measurements  per  scan  position-  Rigorously, 
each  measurement  depends  on  every  one  of  the  state  variables.  This 
is  because  at  each  scan  position,  all  the  lower  atmosphere  pro- 
vides a source  for  multiply-scattered  photons,  whereas  all  the 
upper  atmosphere  contributes  to  and  damps  fche  received  signal. 

Thus,  in  a completely  rigorous  inversion,  the  dimensionality  of 
covariance  and  partial  derivative  matrices  can  be  of  the  order  of 
hundreds  by  hundreds. 

The  solution  to  the  problem  of  large  dimensionality  lies  in 
approximations  that  replace  the  one  large  inversion,  with  its  many 
measurements  and  many  state  variables,  by  many  small  inversions, 
each  involving  just  a few  measurements  and  a few  state  variables. 

The  convenient  partitioning  is  by  scan  position;  the  state  variables 
at  the  tangent  altitude  of  a given  scan  are  found  from  the  multi- 
spectral  radiance  measurements  at  that  scan.  The  effect  of  the 
upper  atmosphere  can  be  fully  accounted  for  by  making  the  state  a 
function  of  a running  variable  v = altitude,  starting  the  inversion 
at  the  top  of  the  atmosphere,  and  working  downward  from  there,  so 
that  accurate  estimates  of  the  atmospheric  parameters  above  each 


225 


position  are  available.  The  lower  atmosphere  can  be  accounted  for 
at  least  approximately  by  using  the  prior  estimate  for  it.  The 
overall  approach  is  colloquially  called  "onion  peeling." 

The  onion  peeling  technique  described  above  is  remarkably 
efficient  because  most  of  the  signal  received  at  each  scan  position 
comes  from  near  the  tangent  altitude.  This  is  the  case  because 
the  atmosphere  is  maximally  dense  near  the  tangent  altitude,  and 
because  the  geometric  length  of  the  line  of  sight  through  altitudes 
near  tangent  is  large.  Figure  3 plots  density  of  signal  contri- 
bution against  slant  range  along  the  scan.  For  a nearly  exponen- 
tially decaying  atmosphere,  the  curve  is  nearly  Gaussian,  and  the 
width  of  that  Gaussian  corresponds  to  a small  altitude  increment 
because  of  the  limb  scan  geometry. 

Even  with  the  dimensionality  of  the  problem  reduced  by  the 
onion  peeling  described  above,  there  is  still  a potential  diffi- 
culty due  to  the  dynamic  range  of  the  variables.  It  is  possible, 
in  particular,  for  the  diagonal  entries  of  covariance  matrices  to 
be  of  significantly  different  magnitudes,  the  result  being  that 
the  required  matrix  inversions  are  numerically  difficult.  This 
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Fig,  3.  Scan  through  Earth’s  Atmosphere 
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difficulty  is,  however,  easily  circumvented  by  redefining  the 
variables.  We  let  the  measurement  be  log  (radiance)  instead  of 
radiance,  and  correspondingly  we  let  the  state  elements  be  log 
(constituent  density)  or  log  (aerosol  physical  property) , as  the 
case  may  be. 

Radiative  transfer  simulations  are  required  in  the  limb  scan 
inversion  problem  to  produce  the  predicted  measurement  z (x)  and  the 
partial  derivative  matrix  H.  A first  problem  to  be  faced  in  per- 
forming a simulation  is  that  the  aerosol  physical  properties  (Junge 
size  parameter  and  complex  refractive  index)  enter  the  radiative 
transfer  problem  only  indirectly,  through  the  resulting  optical 
properties  (phase  functions  and  cross  sections) . In  an  inversion 
procedure,  it  is  not  practical  to  take  a conventional  approach, 
calculating  the  aerosol  optical  properties  from  the  physical  pro- 
perties with  a Mie  code.  Instead,  a precalculated  aerosol  model 
can  be  used-  The  model  used  here  consists  of  polynomial  coeffi- 
cients obtained  from  multivariate  regression  of  a large  data  base 
of  aerosol  optical  properties  calculated  from  physical  properties 
by  a Mie  code.  In  milliseconds  of  computation,  the  resultant  model 
yields  values  of  phase  function,  scattering  cross  section  and 
absorption  cross  section,  given  Junge-size  parameter  and  complex 
index  of  refraction. 

For  the  limb  scan  situation,  the  calculation  of  measurement 
given  optical  state  is  still  a nontrivial  problem.  The  process  is 
governed  by  a complicated  integro-dif ferential  equation  of  radia- 
tive transfer  requiring  elaborate  computer  simulation.  The  curved 
geometry  inherent  in  the  problem  restricts  the  niimber  of  applicable 
computer  models,  and  at  present  only  two  accurate  techniques  are 
available:  Monte  Carlo  simulation  as  described  by  Collins  and  Wells 
(Ref.  7)  and  DART  simulation  as  described  by  Whitney  (Ref.  8). 

Other  than  these,  there  is  only  a simple  but  unrealistic  single 
scattering  approximation. 
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The  choice  of  which  of  the  available  radiative  transfer  models 
to  use  is  dictated  by  speed  requirements.  In  the  course  of  a limb 
scan  inversion,  the  radiative  transfer  model  is  called  many  thou- 
sands of  times.  The  number  of  calls  grows  multiplicatively  with 
the  number  of  wavelengths,  number  of  state  variables,  number  of 
iterations,  and  niomber  of  scan  positions,  and  typically  approaches 
10  . Therefore,  only  the  single  scattering  and  DART  models  are 
really  practical  to  use. 

Although  the  computational  difficulties  associated  with  limb 
scan  inversion  are  largely  circumvented  by  the  steps  described 
above,  there  remain  difficulties  associated  with  nonlinearity. 

These  are  of  a fundamental  nature  not  particularly  limited  to  the 
limb  scan  problem.  Their  solution  is  described  in  the  next  section. 

IV.  ESTIMATING  PARTIAL  DERIVATIVES  FOR  NONLINEAR  INVERSIONS  IN 

GENERAL 

It  was  found  that  even  the  combined  application  of  all  the 
techniques  described  in  the  last  section  was  insufficient  to  pro- 
duce a scattered  limb  scan  inversion.  The  reason  is  the  extreme 
nonlinearity  of  the  radiative  transfer  over  the  dynamic  range  of 
the  state  variables  normally  encountered.  Because  nonlinearity  is 
a characteristic  of  many  inversion  problems,  and  not  just  the  limb 
scan,  the  technique  for  overcoming  it  presented  here  may  be  fruit- 
fully applied  to  other  nonlinear  problems  as  Well. 

The  heart  of  the  problem  is  in  the  calculation  of  the  partial 
derivative  matrix  H.  The  radiative  transfer  is  sufficiently  non- 
linear over  the  state  variable  excursions  required  that  the  usual 
idea  of  partial  derivative  is  not  suitable.  The  local  slope  of  a 
curve  of  radiance  against  state  variable  simply  does  not  follow 
the  curve  far  enough  to  be  used  as  an  accurate  basis  for  updating 
the  state. 

Given  the  nonlinearities  of  the  problem,  we  are  likely  to 
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encounter  the  situations  illustrated  in  Figs.  4a  and  4b.  The 
radiance  as  a function  of  number  density  of  aerosols  is  plotted 
as  a sigmoid  curve.  The  left  asymptote  is  found  at  aerosol  densi- 
ties so  low  that  the  radiance  is  entirely  determined  by  the  other 
atmospheric  constituents.  The  right  asymptote  is  found  at  aerosol 
densities  so  high  that  the  atmosphere  is  essentially  opaque.  A 
measured  value  of  radiance  is  indicated  by  the  dashed  lines  in  the 
figures.  In  Fig.  4a,  the  prior  estimate  of  the  state  is  in  a 
region  where  the  local  slope  is  so  small  that  the  updated  state 
estimate  resulting  from  that  slope  is  vastly  larger  than  the  actual 
state.  In  Fig.  4b,  the  prior  estimate  is  in  a region  where  the 


(a)  . Overshoot  due  to  partial  derivative. 


(b) . Undershoot  due  to  partial  derivative. 

Fig.  4.  Radiance  as  a function  of  number  density  of  aerosols. 
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local  slope  is  so  large  that  the  updated  state  estimate  resulting 
from  that  slope  is  much  too  close  to  the  prior  estimate.  It  is 
possible  for  the  overshoot,  in  the  case  of  Fig.  4a,  to  throw  the 
problem  outside  the  niimerical  operating  range  of  the  computer,  and 
it  is  possible  for  the  undershoot,  in  the  case  of  Fig.  4b  to  inter- 
minably delay  convergence,  in  either  case  making  the  problem 
unsolvable  in  a practical  sense. 

The  remedy  for  the  problems  described  is  to  use  a derivative- 
like construct  that  is  more  global  in  its  meaning  than  the  local 
partial  derivative.  Such  a construct  is  determined  by  first  formu- 
lating a mathematical  model  for  the  curves  of  radiance  as  a function 
of  the  state  parameter.  To  acknowledge  the  changing  slope,  the 
radiance  is  modeled  as  a quadratic  function  of  the  state  parameter. 
Figure  5 illustrates  how  such  a model  can  be  fairly  accurate  over  a 
region  substantial  enough  to  include  the  required  state  excursions. 
The  quadratic  model  is  actually  constructed  by  the  following  steps; 

1.  Establish  the  prior  estimate  of  radiance,  based  on  the 
state  prior  estimate. 

2.  Perturb  the  state  ± one  standard  derivation,  and  calculate 

/ 


State  Actual 

Prior  State 

Estimate 


Fig.  5.  Quadratic  fit. 


the  perturbed  radiances. 


3.  Fit  a quadratic  curve  through  the  three  points. 
Thus,  the  model  is: 

2 

z.  = a + b(x.  - X ) + c(x.  - X ) 

1 12  ' 1 2 

where  i = 1,  2,  3,  and  the  coefficients  are  found  as: 


(Zg  - Z^)  (X^ 


X2>  (X2 


Xj^)  + (X2  - X^)  (Xg 


The  quadratic  model  is  the  basis  for  evaluating  a derivative 
like  construct  that  is  more  global  in  its  meaning  then  the  local 
partial  derivative  is.  The  construct  is  evaluated  by  the  following 
algorithm. 


1.  If  the  measurement  line  intersects  the  quadratic  model 
curve,  replace  the  partial  derivative  by  the  slope  of  the  line 
from  prior  estimate  point  to  the  nearest  intersection. 

2.  If  the  measurement  line  does  not  intersect  the  quadratic 
model  cuirve,  replace  the  partial  derivative  by  the  slope  of  the 
line  from  prior  estimate  point  to  the  extremum  of  the  quadratic. 


V.  SIMULATION  RESULTS 

This  section  presents  inversion  results  for  a simulated  limb 
scan  experiment.  In  the  simulation,  a set  of  limb  radiances  is 
calculated  from  a known  atmospheric  state.  These  radiances  are 
treated  as  "measurements**  and  inverted  to  retrieve  the  known  state. 

Seven  state  parameters  are  inverted,  namely:  (1)  concentration 

of  Rayleigh  scatterers;  (2)  concentration  of  ozone;  (3)  concentration 
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of  nitrogen  dioxide;  (4)  aerosol  extinction;  (5)  aerosol  Junge- 
size  parameter;  (6)  aerosol  refractive  index,  real  part;  and 
(7)  aerosol  refractive  index,  imaginary  part.  Eight  wavelengths 
are  used  in  the  inversion:  0.3500,  0.4000,  0.4500,  0.5000,  0.5500, 
0.6750,  0.7770,  and  0.8630  ym.  These  were  chosen  to  fall  within 
the  sensitive  spectral  range  of  a silicon  diode  detector,  and  at 
the  same  time  to  involve  only  the  chosen  parameters  as  optically 
active  constituents.  The  specific  problem  geometry  has  the  scan 
direction.  Earth  centroid,  and  satellite  lying  in  the  same  plane 
with  the  Sun  nearly  behind  the  detector  so  that  the  scattering 
angle  is  approximately  164^.  For  aerosol  inversion,  this  is  a 
moderately  good  geometry,  in  that  the  aerosol  angular  scattering 
function  is  roughly  midway  between  its  highest  and  lowest  values. 

For  each  state  parameter,  three  graphs  have  been  constructed. 
The  first  displays  the  state  parameter  as  a function  of  altitude, 
the  second  shows  the  evolution  of  the  inverted  state  parameter  as 
the  inversion  process  is  recycled  at  a particular  scan  altitude, 
and  the  third  displays  error  bars  for  the  completed  inversion. 

The  particular  graphs  presented  were  constructed  with  input 
noise  statistics,  but  without  actual  noise  values.  Ideally,  they 
should  be  representative  of  mean  performance  with  zero  mean  noise, 
and  the  error  bars  should  describe  the  spread  to  be  expected. 

Later  runs  with  actual  random  noise  included  required  an  artificial 
scaling  down  of  the  gain  by  a factor  of  5 to  produce  the  anticipated 
behavior. 

The  atmospheric  state  is  defined  at  altitude  intervals  of  1 
kilometer,  and  three  varieties  of  state  are  refhresented  in  the 
first  graph.  These  are: 

1.  Prior  Estimate  State — The  first  guess,  constructed  from 
various  standard  sources  which  are  compiled  in  Malchow  (Ref.  9) . 

This  state  is  represented  by  relatively  smooth  functions  of  alti- 
tude. 
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2.  True  State — This  is  the  state  that  is  used  to  produce  the 
simulated  measurements.  It  is  constructed  from  various  sources  of 
real  data,  except  for  the  aerosol  parameters,  which  are  chosen  by 
random  selection  from  what  is  considered  a likely  range  of  varia- 
tion. 

3.  Inverted  State — This  is  the  state  found  by  the  inversion 
process.  Since  no  noise  has  been  added  to  the  simulated  measure- 
ments, this  state  is  identical  to  the  true  state  when  the  inversion 
is  perfect. 

The  scan  range  extends  from  22  to  13  kilometers.  For  22  kilometers 
and  above,  the  true  state  and  prior  estimate  state  are  set  equal 
in  order  to  focus  attention  on  the  inversion  process  itself  and  not 
the  consequences  of  initial  errors.  It  has  been  found  in  other 
simulations  that  such  initial  errors  can  be  overcome  in  approxi- 
mately three  scans. 

The  second  graph  of  each  set  shows  the  path  followed  by  each 
parameter  through  state  space  en  route  to  convergence.  Since  the 
limb  problem  is  highly  nonlinear,  many  iterations  of  the  inversion 
procedure  are  carried  out  at  each  scan  position.  The  convergence 
curve  represents  only  the  20  kilometer  scan  altitude.  For  this 
particular  altitude  it  can  be  seen  from  the  curves  that  a stable 
solution  is  achieved  in  10  iterations.  However,  more  iterations 
are  needed  at  the  higher  altitudes. 

The  third  graph  of  each  set  shows  the  la  error  bars.  The 
error  bar  plotted  at  23  kilometers  represents  the  covariance  input 
to  the  problem.  The  error  bars  at  lower  altitudes  are  generally 
smaller,  having  been  reduced  by  the  information  gained  in  the 
inversion  process.  In  cases  where  even  further  reduction  is 
desired,  avaraging  over  data  sets  can  be  employed. 

The  first  set  of  data  (Figs.  6,  7,  and  8)  is  for  Rayleigh  scat- 
terers  (also  called  neutral  density) . Since  this  constituent  has 
little  variability  relative  to  its  range  with  altitude,  the  prior 
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Fig.  6.  Neutral  density 
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yolution. 
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estimate  is  plotted  separately  from  the  inverted  and-  true  states. 

The  first  graph  (Fig.  6)  shows  a good  tracking  of  the  true  state 
by  the  estimated  state.  At  20  kilometers  the  true  and  prior  esti- 
mate states  differ  by  3%  initially.  This  is  representative  of  a 
la  excursion.  After  8 iterations,  the  second  graph  (Fig.  7)  shows 
close  convergence  to  the  true  state.  Error  bar  results  for  this 
constituent  (Fig.  8)  show  a small  reduction  of  initial  input 
uncertainty . 

The  second  set  of  data  (Pigs.  9,  10,  and  11)  is  concerned  with 
ozone  concentrations.  The  true  ozone  state  (based  on  data  from 
Ref.  10) , is  marked  by  strong  perturbations  both  up  and  down  from 
the  prior  estimate.  However,  ozone  is  strongly  represented  in  the 
simulated  measurements  at  the  chosen  scan  altitudes,  with  the 
result  that  this  constituent  can  be  inverted  quite  accurately  as 
the  inverted  and  true  states  show.  At  20  kilometers,  the  initial 
perturbation  was  about  -50%.  The  initial  uncertainty  assumed  for 
ozone  is  la  = 60%,  and  the  inversion  reduces  this  by  a factor  of 
3 or  4. 

The  next  set  of  graphs  (Figs.  12,  13,  and  14)  is  concerned  with 
NO^.  This  constituent  has  relatively  small  prior  estimate  (Ref. 

11)  optical  depths  for  the  chosen  scan  altitude  regime  and  the 
true  state  (based  on  our  SKYLAB  measurements)  is  even  smaller. 
Therefore,  one  expects  and  gets  less  accuracy  for  NO^  in  the 
inversion  than  for  ozone.  The  error  bars  show  a modest  reduction 
from  the  assumed  initial  uncertainty  of  la  = 300%. 

Aerosol  extinction  is  the  siibject  of  the  next  set  of  graphs 
(Figs.  15,  16,  and  17)  , In  this  case,  the  prior  estimate  curve  is 
based  on  Elterman's  data  (Ref.  12),  The  true  state  is  constructed 
from  lidar  data  (Ref.  13),  and  shows  layered  structure.  Since 
Since  aerosol  extinction  exhibits  a great  deal  of  spatial  dynamics, 
it  is  important  that  the  inversion  process  can  deal  with  a large 
range  of  extinction  values.  This  particular  inversion  problem  shows 
that  order  of  magnitude  changes  in  aerosol  extinction  over  a 1 
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Fig-  13,  Nitrogen  dioxide  convergence 
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kilometer  interval  can  be  dealt  with  by  the  nonlinear  algorithm. 

The  error  bars  for  aerosol  extinction  show  a reduction  of  the 
initial  order  of  magnitude  uncertainty  by  as  much  as  a factor  of 
50. 

The  final  three  sets  of  graphs  (Figs.  18  to  26)  are  con- 
cerned with  inversion  of  aerosol  physical  parameters.  A note- 
worthy feature  of  these  graphs  is  that  the  inversion  accuracy  for 
these  parameters  is  proportional  to  the  aerosol  extinction.  In 
each  case,  the  closest  agreement  between  the  true  and  inverted 
states  occurs  at  the  peaks  in  aerosol  extinction.  Graphs  of 
aerosol  extinction  have  been  superposed  on  the  error  bar  graphs  to 
illustrate  this  point.  The  results  show  th*-.t  when  moderate  aerosol 
concentrations  are  present,  useful  information  about  the  aerosol 
physical  characteristics  can  be  obtained  from  limb  scans. 

In  general,  the  simulated  measurement  calculations  displayed 
here  demonstrate  that  sufficient  information  is  present  in  the  con- 
sidered wavelength  set  to  separate  and  invert  the  studied 
constituents.  An  important  next  step  in  the  simulation  studies 
will  be  to  relate  the  expected  inversion  accuracy  to  the  quality 
of  the  measurements.  Other  potential  error  sources  that  will 
require  analysis  include  albedo  uncertainties,  modeling  errors, 
such  as  those  from  quadrature  in  the  radiative  transfer  calcu- 
lations, and  errors  in  the  constituent  cross-section  averages 
over  finite  wavelength  intervals. 
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Fig.  18.  Junge  size  parameter  solution. 
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Fig.  20. 
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Fig.  22.  Aerosol  real  refractive  index  convergence. 
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Fig.  23.  Aerosol  real  refractive  index  error  bars. 
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imaginary  index 
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Fig.  24.  Aerosol  imaginary  refractive  index  solution. 
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Fig.  25.  Aerosol  imaginery  refractive  index  convergence. 
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VI.  CONCLUDING  REMARKS 


This  paper  has  demons txated  the  feasibility  of  inverting 
multispectral  horizon  profile  radiance  data  at  altitude  inter- 
vals of  1 kilometer  for  number  densities  of  Rayleigh  scatterers^ 
ozone,  aerosols,  and  for  aerosol  physical  properties 

including  a size  distribution  parameter  and  the  con^lex  index  of 
refraction.  This  is  the  first  demonstration  of  a remote  sensing 
technique  that  would  provide  all  this  information  simultaneously 
on  a global  basis  and  with  all  the  correlations. 

The  inversion  technique  itself  extends  standard  nonlinear 
estimation  theory  in  order  to  accommodate  the  limb  scan  problem. 

A crucial  aspect  of  the  extension  is  the  use  of  a derivative-like 
construct  which  is  more  global  in  its  meaning  than  the  local 
partial  derivative.  Nonlinear  estimation  theory  extended  in  this 
way  apparently  constitutes  the  first  numerical  seetrch  technique 
based  on  a construct  other  than  the  local  partial  derivative. 

The  multi-spectral  horizon  profile  remote  sensing  technique 
and  the  extended  nonlinear  inversion  technique  together  imply  data 
requirements  and,  hence,  specifications  for  a high  quality  radio- 
meter. In  conjunction  with  the  analysis  and  simulation  effort 
reported  here,  Charles  Stark  Draper  Laboratory  has  designed  an 
instrument  specifically  to  meet  the  implied  requirements. 


SYMBOLS 


a scalar  coefficient 

b scalar  coefficient 

c scalar  coefficient 

C covaricuice  of  q 

q 

D determinant 

]g  partial  derivative  matrix 

I identity  matrix 
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gain  matrix 
noise 

state  covariance 

prior  estimate  of  state  covariance 
quantity 

actual  value  of  quantity 

expected  value  of  quantity 

noise  covariance 
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running  variable 
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atmospheric  state  estimate 

actual  state 

prior  estimate  of  state 

radiance  measurement 

wavelength 

standard  deviation 
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DISCUSSION 


IVitty:  First,  your  results  are  really  incredible.  I missed  some 

where  just  what  went  into  this.  What  is  your  input  from  which  you 
are  extracting  all  of  these  parameters?  What  spectral  inputs  and 
what  scattering  information? 

Malchow:  We  are  assuming  that  there  is  a multi-channel  photometer 

that  has  eight  wavelength  channels  of  information  ranging  from 
4000  to  7000  k.  The  band  width  of  each  channel  is  rather  narrow, 
something  like  20  or  50  A.  It  depends  somewhat  on  the  instrument 
design. 

Chahine:  What  are  you  measuring? 

Malchow:  We  are  doing  scans  of  the  sunlit  limb. 

Twitty:  So  this  is  through  the  limb  with  the  Sun  in  one  specific 

place? 

Whitney:  It  doesn't  matter  where  the  Sun  is.  Sun  can  be  anywhere 

Twitty;  For  the  results  you  show  here,  the  Sun  was  way  overhead? 

Malchow:  It  was  somewhat  behind  and  off  to  the  side. 

Whitney;  It  happened  to  be  what  was  in  that  Sky lab  experiment. 

We  constructed  the  geometry  accordingly. 

Gille:  Do  you  have  any  data  from  SkylcdD  or  other  similar  kind  of 

experiments  that  you  can  test  this  on? 

Malchow:  We  have  Skylab  data  but  it  is  really  bad.  First  of  all, 

the  channels  are  not  calibrated  relative  to  one  another  because 
there  was  a rush  to  get  the  instrument  onto  th€^  spacecraft.  They 
were  calibrating  it  at  Woods  Hole  or  somewhere  when  the  calibration 
light  bulb  burned  out.  They  could  not  get  another  one  before  the 
instrument  had  to  go  on  the  spacecraft.  So  they  did  try  to  cali- 
brate it  but  with  about  10%  uncertainty  in  one  channel  or  another. 
We  cauinot  really  work  with  that  much  noise.  If  the  noise  levels 
were  lower,  we  could  probeibly  fit  it  to  a clean  atmosphere  at 
higher  altitudes,  but  the  present  noise  levels  cure  just  too  large. 

Herman:  I think  what  is  puzzling  some  people  here  is  the  question 

of  whether  these  were  simulated  measurements  or  not.  They  were 
simulated,  am  I correct? 

Malchow:  Yes. 
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Herman:  Now  to  do  this,  apparently  you  must  have  performed  cal- 

culations using,  let's  say,  a Junge  size  distribution? 

Malchow:  Yes. 

Herman:  Now  suppose  the  actual  size  distribution  is  non-Junge. 

That  would  throw  everything  off,  am  I correct? 

Malchow:  Yes,  indeed. 

Herman:  Now  the  other  thing  is  you  were  able  to  extract  more  this 

way  than  we  can  from  ground-based  measurements  and  I am  trying  to 
figure  out  why.  There  must  be  something  in  your  computer  simu- 
lation that  is  enabling  you  to  reproduce  all  these  various  variables 
more  accurately  than  can  be  achieved  in  practice. 

Malchow:  I think  one  of  the  advantages  over  the  ground-based  data 

problem  is  that  in  the  stratosphere  we  are  not  looking  through  as 
much  atmosphere  as  on  the  way  in  or  out.  So  we're  getting  more 
signal  from  the  particular  layer  that  we  are  looking  at  relative 
to  the  total  signal  that  we  are  getting.  As  to  the  question  of 
whether  the  Junge -size  parameter  is  relevant  or  not,  in  any  of 
these  inversion  processes  you  find  what  you  are  looking  for.  If 
you  put  in  six  parameters  to  describe  the  state,  you  find  six 
parameters.  All  we  have  done  is  an  exercise  with  the  simplest 
analytic  model  that  gave  us  some  kind  of  infoimiation  about  the 
aerosol  size  distribution. 

Unidentified  Speaker:  Where  are  the  errors  involved? 

Whitney : There  were  error  statistics  in  the  measurement,  which 

did  not  actually  perturb  the  measurements.  That  would  be  the 
desirable  thing  to  do,  namely,  to  run  through  that  exercise.- 

Malchow:  Yes,  all  we  have  shown  is  a closed  problem  in  effect. 

Reagan:  To  amplify  a little  bit  on  what  Dr.  Herman  was  talking 
about,  I am  curious  about  the  globality  or  the  global  ability  for 
the  optimization  to  home  in  and  along  with  that  I would  like  to 
ask  a question  about  the  sequence  that  you  went  about  for  say 
determining  refractive  index  and  size.  Was  it  sequential?  Was 
it  simultaneous? 

Whitney:  Yes,  simultaneous. 


Post-Workshop  comments  added  by  the  authors — "We  were  able 
to  do  that  soon  after  the  Workshop.  We  found  that  preserving  the 
stability  required  cutting  the  gain  by  a factor  of  five." 
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Reagan:  Also,  if  you  changed  things  a bit  and  you  still  came  up 

with  the  same  results,  that  would,  it  seems,  say  something  about 
the  global ity  of  it. 

Irvine:  You  have  assumed  that  you  have  values  of  the  intensity 

measured  in  a two-dimensional  altitude  and  frequency  plane.  How 
much  do  the  parameters  that  you  are  looking  for  influence  the 
intensity  in  different  portions  of  that  plane  so  that  you  get 
some  separation  there? 

Nhitney:  Well,  for  instance,  the  NO2  tends  to  be  in  a layer  that 

is  at  a different  altitude  than  the  ozone,  and  aerosols  have  their 
own  characteristics  with  altitudes. 

Jrvine:  And  there  is  different  wavelength  dependence  too? 

Whitney:  Correct.  The  various  constituents  are  active  at  dif- 

ferent wavelengths.  We  have  not  yet  conducted  a channel  optimiza- 
tion and  that  is  an  exercise  that  must  be  done.  I believe  that 
the  only  basis  for  working  with  those  particular  wavelengths  shown 
here  is  that  they  correspond  to  what  was  in  the  Skylab  data. 

Irvine:  Can  I ask  you  something  else?  What  percentage  abundance 

of  NO^  can  you  hopefully  detect?  What  mixing  ratio? 

Malchow:  As  you  saw  in  our  results,  we  are  getting  into  diffi- 
culties with  the  levels  that  we  used  in  this  experiment.  The 
mmibers  that  appear  on  those  graphs  are  getting  down  to  about  the 
lower  limit  of  what  you  can  deal  with. 

Irvine:  And  was  that,  something  like  one  part  in  10^^? 

Malchow:  I don't  remember  what  the  mixing  ratio  was.  I can  show 

you  what  the  actual  density  values  were.  But  I would  say  that  for 
nominal  NO2  it  is  probably  shaky  in  the  altitude  region  where  we 
did  the  inversion. 

Whitney:  That's  pretty  far  away  from  where  it  is  peaked. 

Irvine:  Does  one  have  any  hope  of  detecting  some  of  the  compounds 

that  are  related  to  the  ozone  abundance — the  fluorocarbons  and  the 
do  and  things  like  that? 

Whitney:  I tend  to  be  doubtful  with  the  wavelengths  we  used 

because  there  is  a limit  to  how  many  independent  wavelengths  there 
really  are.  And  you  cannot  look  for  more  bits  of  information  than 
are  really  there. 

Shettle:  I have  a question  regarding  the  relation  of  your  error 

bars  to  the  accuracy  of  your  convergence.  Your  fitted  values  seem 
to  converge  within  a small  fraction  of  the  error  bars  you  show. 
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Whitney:  That  is  because  we  did  not  actually  perturb  the  measure- 

ment values.  We  simply  processed  the  statistics.  We  did  not  do 
a Monte  Carlo  type  of  problem  where  you  would  actually  put  the 
perturbations  on  in  accordance  with  those  statistics. 

Thomas:  I feel  that  the  quality  of  your  results  in  regard  to  the 

aerosols  might  arise  from  the  fact  that  you  only  had  one  parameter 
in  the  size  distribution.  We  have  taken  a look  at  the  possible 
inversion  of  all  of  the  matrix  elements,  using  a three -parameter 
model,  cind  we  found  three  orders  of  magnitude  separation  between 
the  first  and  second  eigenvalues.  So  limiting  the  model  to  one 
parameter  might  be  of  great  value  to  you. 

Malchow:  Yes,  I think  it  was.  Two  parameters  would  be  really 

tough,  and  more  parameters  than  that  I cannot  really  see  being 
handled  easily. 


Editorial  Footnote:  The  following  statement  was  submitted  by  the 

authors  soon  after  the  Workshop — "Some  of  the  questioners  of  our 
results  expressed  concern  that  we  had  not  produced  simulations 
that  included  random  noise  inputs.  Within  the  week  following  the 
meeting,  we  did  successfully  run  simulations  with  additive  random 
noise,  and  we  obtained  results  that  were  quite  satisfactory.  To 
produce  these  results,  we  found  it  necessary  to  introduce  a small 
amount  of  gain  damping - " 


263 


INVERSION  OF  SOLAR  AUREOLE  MEASUREMENTS 


FOR  DETERMINING  AEROSOL 
CliARACTERISTICS 

Adarsh  Deepak 
Old  Dominion  University 
and 

Institute  for  Atmospheric  Optics  and 
Remote  Sensing  (IFAORS) 


Solar  aureole  is  the  region  of  enhanced  sky  brightness 
within  about  20^  around  the  Sun's  disk,  mainly  because  of  the 
predominantly  forward  scattering  of  aerosol  particles . It  is 
shown  that  the  solar  aureole  radiance  is  very  sensitively 
dependent  on  the  aerosol  size  distributions.  The  photographic 
solar  aureole  isophote  (PSAI)  measurement  technique  for  deter- 
mining the  aerosol  size  distribution  n(r)  and  other  charac- 
teristics takes  advantage  of  this  sensitivity . Single  scat- 
tering theory  of  the  solar  aureole  is  given.  The  assumptions 
and  conditions  imposed  on  the  single  scattering  theory  to 
make  it  tractable  to  inversion  are  discussed.  The  important 
role  of  the  almucantar  measurements  is  also  discussed . Efforts 
that  need  to  be  performed  in  the  near  future  are  also  stated. 


I . INTRODUCTION 

As  man  makes  more  and  more  of  an  impact  on  his  environment 
because  of  the  rapidly  expanding  technology,  it  becomes  increasingly 
imperative  to  study  the  background  level  of  these  aerosols  in 
order  to  monitor  how  man  is  affecting  the  balance  in  the  atmosphere 
and  what  effects  these  aerosols  will  have  on  such  things  as  cli- 
mate, air  quality,  solar  radiation  dosage,  man's  health,  food  pro- 
duction, etc. 
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Remote  sensing  technifjues  based  on  extinction  and  scattering 
of  electromagnetic  radiation  by  aerosols  are  perhaps  the  most  prac- 
tical and  economical  way  of  diagnosing  and  monitoring  the  atmo- 
spheric aerosols  on  a long-term  basis.  The  aerosol  characteristics 
of  special  interest  are:  the  size  distribution  n(r),  the  complex 
refractive  index  ffi  = m*  - im" , and  the  altitude  distribution  of 
these  quantities.  Of  these,  the  size  distribution  perhaps  plays 
the  most  important  role  in  electromagnetic  scattering  phenomena 
and  atmospheric  processes.  It  is  assumed  for  the  sake  of  sim- 
plicity that  the  aerosols  are  spherical,  r being  the  radius. 

Nearly  all  the  aerosol  remote  sensing  methods,  active  and 
passive,  are  based  on  the  measurement  of  extinction,  scattered 

intensity,  or  polarization  of  the  direct,  backscattered , or  multi- 
angle scattered  radiation  made  multi- spectrally  or  through  narrow 
bandpass  spectral  filters.  It  is  by  inverting  the  measurements 
obtained  by  one  or  a combination  of  these  methods  that  the  aerosol 
characteristics  are  usually  determined.  In  this  paper,  simple  and 
practical  methods  based  on  the  measurement  and  inversion  of  the 
solar  aureole  radiance  are  described. 

II.  SOLAR  AUREOLE  MEASUREMENTS 

The  solar  aureole  is  the  area  of  enhanced  brightness  closely 
surrounding  the  Sun*s  disk  (within  about  20^)  because  of  mostly 
aerosol  scattering  of  sunlight.  Since  the  aerosols  scatter  pre- 
dominantly in  the  forward  direction,  the  contribution  of  atmo- 
spheric haze  to  the  sky  radiance  for  angles  close  to  the  Sun  is 
roughly  10^  to  10^  times  the  contribution  by  molecular  scattering. 
This  is  illustrated  in  Fig.  1.  It  is  to  take  advantage  of  this 
large  signal  range  that  a simple,  portable,  photographic  solar 
aureole  measurement  (PSAM)  technique  was  developed  at  the 
University  of  Florida  in  1970  (Ref.  1)  and  has  since  been  used  to 
diagnose  the  aerosol  size-altitude  distributions  (Refs.  2 to  5)  by 
using  the  aureole  radiance  measurements  along  the  almucantar. 
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The  almucantar  is  a conical  scan  of  constant  solar  zenith  angle 
with  the  local  zenith  as  the  axis.  A solar  aureole  measiirement 
program  was  subsequently  initiated  at  the  NASA  Langley  Research  Center 
Center  in  1974  and  the  photographic  solar  aureole  isophote  (PSAI) 
method  (Refs.  6 and  7)  was  developed.  Isophotes  are  lines  or 
curves  of  equal  radiance.  The  PSAI  method  is  an  extension  of  the 
almucantar  solar  aureole  radiance  (ASAR)  method.  In  the  latter, 
the  radiance  measurements  taken  along  the  almucantar  are  used  to 
infer  the  aerosol  properties.  In  the  PSAI  method,  in  addition  to 
making  the  almucantar  measurements,  one  takes  advantage  of  the 
fact,  which  emerged  from  our  computer  studies,  that  the  shape  of 
the  solar  aureole  isophotes  is  sensitively  dependent  on  the  charac- 
teristics of  the  aerosol  size  distribution.  Suggestions  for  the 
use  of  solar  aureole  measurements  to  determine  aerosol  properties 
had  also  been  made  earlier  by  Deirmendjian  (Refs.  8 and  9)  and 
other  researchers  (Refs.  10  to  13) . Solar  aureole  measurements, 
taken  with  scanning  photometers  for  determining  aerosol  properties 
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Fig.  1,  Relative  intensities  of  the  sky  brightness  as  com- 
pared to  that  of  the  Sun’s  direct  light,  r represents  the 
distance  from  the  Sun  in  solar  radii.  (From  Ref.  14.) 
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have  also  been  made  by  Shaw  (Ref.  15)  and  Twitty,  et  al.  (Ref.  16)  . 

The  following  sections  briefly  describe  the  theory,  photo- 
graphic measurements,  and  results  of  the  PSAI  method.  For  details 
of  some  of  the  theoretical,  aspects  discussed  here,  see  Ref.  1. 

III.  SIMPLIFYING  ASSUMPTIONS 

The  forward  scattered  radiance,  being  highly  sensitive  to 
n(r) , is  relatively  insensitive  to  effects  due  to  aerosol  refrac- 
tive index,  polarization,  and  multiple  scattering  (MS) , a fact 
that  helps  in  simplifying  the  inversion  problem.  In  this  paper, 
only  the  single  scattering  (SS)  theory  treatment  is  considered 
which  should  help  in  understanding  the  difficulties  involved  in 
the  inversion  of  aerosol  scattering  measurements.  Therefore,  one 
makes  the  following  reasonable  simplifying  assumptions: 

(1)  Particles  are  spherical  so  that  results  of  the  Mie 
theory  can  be  used  in  computations. 

(2)  The  atmosphere  is  horizontally  homogeneous  and  vertically 
inhomogeneous . 

(3)  Absorption  effects  are  ignored  by  selecting  to  work  in 
spectral  regions  for  which  atmospheric  absorption  is  nil. 

(4)  The  polarization  effects  are  small  for  forward  scattered 
light  and  can  be  ignored. 

(5)  For  relatively  clear  days  (visibility  > 15  km) , the  MS 
effects  at  the  forward  scattering  angles  are  small  compared  with 
SS  (Ref.  4)  and  can  be  ignored. 

(6)  An  average  value  for  the  refractive  index  of  all  atmo- 
spheric aerosols  is  assumed  for  forward  scattea^dng. 

(7)  The  atmosphere  is  treated  as  plane-parallel;  the  spher- 
ical Earth  effects,  which  become  significant  for  zenith  angles 

(j)  larger  than  75°,  are  incorporated  into  the  theory  by  the  use  of 
the  generalized  Chapman  type  functions'  S((|))  (Refs.  4 and  17)  in 
place  of  the  secant  functions. 
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IV.  SINGLE  SCATTERING  THEORY  OF  THE  SOLAR  AUREOLE 


Figure  2 illustrates  the  geometry  of  the  calculation.  Shown 
is  an  acceptance  cone  dO,  originating  at  the  detector  and  a solid 
angle  dQ'  centered  at  the  elemental  scattering  volume  dV  at  alti- 
tude y(km)  . and  <p  are  the  zenith  angles  of  the  Sun  and 

the  narrow  view  cone  and  w is  the  dihedral  angle  between  the 
normals  to  the  Sun  zenith  and  view  cone  zenith  planes  intersecting 
at  dV.  The  scattering  angle  jp  is  then  given  by  the  relation 


cos  = cos  (f)  cos 


sin  (p  sin  (p 


s 


cos  OJ 


(1) 


The  element  dV  is  given  by 
dV  = R^  df^  S((|))  dy 


(2) 


where  the  generalized  Chapman  type  functions  (Refs.  4 and  17) 

S((j))  = sec  (p  (for  (p  £ 75°)  (3) 

The  optical  depth  defined  by 


Fig.  2.  TJie  geometry  of  the  sky  single  scattering  problem. 
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for  a ray  traversing  the  distance  from  the  Sun  to  the  air  mass 
element  dV  is  given  by 

T = E T (Xr  y)S.((|)  ) (j  = M,  A)  (5) 

J j J S 

and  from  the  air  mass  to  the  detector  by 

y)}Sj(4>)  (6) 

where  M denotes  air  molecules;  A,  the  aerosol  species;  X,  the 
wavelength;  and  3'  the  volume  scattering  coefficient  (VSC)  (km 
at  altitude  y for  the  jth  constituent.  In  this  paper,  all  quan- 
tities represented  by  x , 3 * , 3^ / F * , F, , P * , and  P,  are  functions 
of  m,  even  though  their  m dependence  is  not  indicated  in  their 
representation  form.  The  primes  denote  the  y-dependence  of  the 
quantities.  3^  defined  by  the  following  relations: 


3^(X,  y) 


y)7Tr^Q(X/  m)dr 


(7) 


where  Q(x,  m)  is  the  efficiency  factor  (Ref.  18)  , x = 2irr/X  is 
the  particle  size  parameter,  r^  and  r^  are  minimum  and  maximum 
values  of  r o.nd  m = m*'  - im”,  the  complex  refractive  index  of 
aerosols. 


y)  = 


(8a) 


where  the  VSC  for  molecules  is 


M 


8TT^(n^  - 1)  ^ (4  + 3d) 
N X'*  (4  - 3d) 


(8b) 


In  Eq.  8(b) , W is  the  number  of  molecules  cm  n,  the  refractive 

index  of  the  medium,  d = 4A/(1  - A);  and  A is  the  depolarization 

o 

of  scattered  light  at  a scattering  angle  of  90  for  a linearly 
polarized  incident  radiation  with  its  electric  vector  perpendicular 
to  the  scattering  plane.  For  unpolarized  incident  light,  A is 
replaced  by  ^ = 2A/(1  + A) . Then  the  volume  scattering  function 
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(VSF)  for  air  molecules  (j  = M)  is 

y)  = ^ ^ 

where  the  molecular  phase  function  is 

Pm('I')  = (1  + cos^jp) , sr  ^ (10) 

where  P^(y)  the  dimensionless  function  representing  the  alti- 
tude distribution  of  molecular  density.  The  VSF  for  aerosols 
(j  = A)  is 

X,  y)  = 3^(A,  y)P^Opr  A,  y)  (11) 


where  the  aerosol  volume  phase  function  is 

rr^ 


A,  y)  = — 

2k23'(X,  y)  J 


n (r,  y)  {ij^  (ip,  m,  x)  + 


(12) 


i (ip,  m,  x) } dr 
2 

and  i^  and  i^  are  the  Mie  intensity  functions  and  k = 2tt/X, 

The  sky  radiance  due  to  the  molecules  and  aerosols  in  the 
volume  element  dV  is  then  given  by 

dB((|),  (()  , 0),  X)  = H (X)  {S^((())F' ((|;,  X,  y)  + 
s o M M 


(13) 


-(T  + T ) 

S^((|))F^(i(;,  A,  y)}  e ^ ^ dy 


Integrating  over  all  such  elemental  volumes,  the  total  single 
scattered  sky  radiance  is 


B((|),  <f)  , w.  A)  = H (A) 
s o 


-E  . T .S  .(()))  r j-oo  -E.x.(A,  y)D. 

e ^ ^ ^ ■ Sjj(<f>)  j e ^ ^ 

0 


f“  (X,  y)D 

+ S^(^)J  e A,  y)dy  . 

0 


(14) 
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where 


D.  = S.  ((|)  ) - S.  ((|))  (j  = M,  A)  (15) 

3 3 s 3 

Before  discussing  the  inversion  problem,  a brief  description  of 
the  photographic  solar  aureole  measxirement  (PSAM)  technique  will 
be  in  good  order. 

V.  THE  PHOTOGRAPHIC  SOLAR  AUREOLE  MEASUREMENT  TECHNIQUE 

A photographic  technique  of  making  measurements  of  the  solar 
aureole  radiance  is  briefly  described  here.  Figure  3 schematically 
illustrates  the  equipment.  Photographs  of  the  Sun's  aureole  are 
taken  with  a small  format  camera  (35  mm  or  70  mm)  through  a wave- 
length filter  with  the  Sun  occulted  by  a neutral  density  (ND)  disc 
(ND  = 4)  held  coaxially  on  a stem  about  0.6  m to  1.3  m (2  to 
4 ft.)  in  front  of  the  lens.  The  ND  filter  attenuates  the  radi- 
ance of  the  direct  sunlight  by  a factor  of  10^,  so  that  the 
optical  density  of  the  Sun's  image  is  of  the  same  magnitude  as 
the  optical  densities  of  the  surrounding  aureole,  as  shown  in  a 
typical  photograph  in  Fig.  4a.  The  Sun's  image  not  only  enables 


ND  FILTER 


Fig,  3.  Schematic  illustration  of  the  arrangement  of  the 
solar  occulting  disc  and  the  camera  for  aureole  photography. 
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A typical  solar  aureole  35  mm-photograph  tal 
length  filter  X = 500  nm,  (h) . An  isodensity 
photograph.  (c)  . The  computed  solar  isophoti 


photograph . 


one  to  calibrate  the  entire  photograph  relative  to  its  radiance, 
but  also  enables  accurate  measurements  to  be  made  of  the  angular 
distances  from  the  Sun.  Photographs  are  taken  through  different 
wavelength  filters.  In  addition,  the  direct  solar  irradiance 
measurements  through  the  same  filters  are  made  with  a photometer 
to  determine  x(A,  0). 

A.  Solar  Aureole  Isophotes 

The  optical  density  of  the  whole  photograph  is  read  with  the 
help  of  a Joyce-Loebl  I sodensi tracer  which  gives  digital  data  out- 
put on  a magnetic  tape  and  at  the  same  time  provides  an  isodensity 
tracing,  such  as  shown  in  Fig.  4b.  Isodensities  are  lines  or 
curves  of  equal  optical  density  in  a photograph.  Isophotes  are 
then  generated  from  the  taped  data  output,  as  shown  in  Fig.  4c, 
where  an  economy-wise  reduction  has  been  made  in  the  number  of 
data  points. 

B.  Almucantar  Radiance 

The  photogramme try  of  the  solar  aureole  is  presented  in 
another  paper  submitted  for  publication,  where  it  is  shown  that 
the  almucantar  projects  on  the  film  as  a conic  (Fig.  5) . The 
shapes  of  the  conics  for  three  different  values  of  the  solar  zenith 
angle  cj)^  are  illustrated  in  Fig.  6.  Accordingly,  the  measured 
almucantar  radiance  as  a function  of  the  scattering  angle  is 
shown  in  Fig.  7.  The  peak  at  0^  corresponds  to  Sun-s  direct 
light  reduced  in  intensity  by  the  ND  disc. 


VI.  THE  INVERSION  OF  SOLAR  AUREOLE  MEASUREMENTS 

In  order  to  make  Eq.  (14)  simpler  and  amenable  to  inversion, 

measurement  should  be  restricted  to  zenith  angles  (j)  or  (|)  less 

o ^ 

than  75  . Then  the  spherical  Earth  correction  can  be  neglected 

and 
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ZENITH 


Fig.  5.  Almucantar  projection  on  the  film  as  a conic. 


ALMUCANTAR  PROJECTION  ON  FILM 


Fig, 
angles  (j)^ 


6.  The  shapes  of  the  conics  for  three  solar  zenith 
- 10^,  25^ , and  45^,  for  a lens  of  focal  length  55  mm. 
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ANGLE  1^,  Deg 

Fig.  7.  Plot  of  the  almucantar  radiance  as  a function  of  the 
scattering  angle  (solid  line) . Symbols  indicate  theoretically 
computed  radiance  for  different  size  distributions. 


= D = sec  (p  - sec  cb  (16) 

AM  ^ 

Equations  (14)  and  (15)  reduce  to 

0),  X)  = G J X,  y)  + X,  y)}  ^^°dy  (17) 

0 

where 


G = H (X)  sec 
o 


*-t(X,  0)  sec  (j) 


With  Eq.  (17)  , it  is  possible  to  obtain  the  information  about 
the  size  altitude  distribution  of  aerosols  frojJBu  the  multispectral 
measurements  of  (a)  sky  radiance  B(X)  as  a function  of  (p^r  (p,  and 
03  and  (b)  the  total  optical  depth  t(X,  0). 

The  altitude  distribution  of  molecular  density  Pj^(y)  can  be 
obtained  from  tables  or  from  radiosonde  data.  The  functions 

y)  and  y)  f both  of  which  depend  on  the  altitude 
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size  distribution  nCr,  y)  / are  the  unJcnowns  in  Eq.  (16). 

Therefore,  in  order  to  make  Eq.  (16)  tractable  to  inversion,  an 
assijmption  has  to  be  invoked  about  the  separability  of  the 
altitude-size  distribution  function  n(r,  y)  in  the  form 

ri(r,  y)  = n(r)p^(y)  (19) 

where  the  dimensionless  quantity  P^(y)  is  defined  by 

P^(y)  = N(0,  y)/N(0,  0)  (20) 

— 3 

N(0,  y)  being  the  number  of  particles  cm  at  altitude  y(km) . 


This  assiomption  implies  that  the  form  of  the  particle  size 
distribution  n(r)  does  not  itself  change  with  altitude  y,  but  only 
the  number  density  or  concentration  varies.  This  is  a reasonable 
assumption  in  view  of  Junge's  experimental  observations  that  up  to 
about  3 km,  the  aerosol  size  distribution  n(r)  remains  nearly  con- 
stant. Since  most  of  the  aerosols  are  concentrated  below  3 km, 
and  according  to  Elterman's  data,  the  aerosol  density  falls  off 
nearly  two  to  three  orders  of  magnitude  at  an  altitude  of  about 
5 km,  the  error  introduced  due  to  extension  of  the  assumption  of 
constancy  of  n(r)  to  regions  above  3 km  is  small.  Then,  functions 

3'/  F*f  and  P'  reduce  to 
A A A 


y)  = X)p^(y) 

and 

y)  = A) 

where 


3a(^) 


irr^Q(x,  in)n(r)dr 


(21) 

(22a) 

(22b) 

(23a) 


and 
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(23b) 


X) 


+ i^)n(r)dr 


Thus, 


(23c) 


In  case  two  aerosol  layers  having  different  size  distributions  are 
present  in  the  atmosphere/  it  is  then  convenient  to  choose  a two- 
term  model  for  n(r/  y) , such  as 

n(r/  y)  = 


which  is  separable  in  r and  y in  each  of  its  terms.  This  case  is 
not  discussed  here,  but  will  be  treated  in  a subsequent  paper. 

In  Eq.  (17) / both  the  molecular  and  aerosol  scattering  con- 
tributions depend  on  the  unknown  quantity  t (y) . Let  us  define 

J\ 

the  "effective  transmission"  functions  for  air  molecules  and 
aerosols  as 


’m'W 


w„(0) 


dy  p^(y) 


-Z .T , (A,  y)D 
* 3 3 


(25) 


and 


w,  (0) 


-Z.t.(A,  y)D 
dy  (y)  e ^ ^ 


(26) 


0 


where  the  integrated  thickness  is  given  by 

(j  = M,  A) 


Wj  (y)  = 


Pj  (y')dy’ 


(27) 


By  using  Eqs.  (25)  and  (26) , Eq.  (17)  can  be  written  as 


B((|),  <j)^,  0),  A)  = G{Pj^(<|),  A)Tj^(A,  O)T^(D)  + P^(i(^,  \)t^(A,  O)T^(D)}  (28) 


The  y-dependence  on  the  right-hand  side  of  Eq-  (28)  is  now  confined 

to  the  two  factors,  T,  and  T^,  representing  effective  transmission 

M A' 
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factors.  It  can  easily  be  shown  that  T is  related  to  as  fol- 

M A 

lows: 


T„(X,  0) 


— y)D 

e 


1 


-T  (X,  0)D'\ 
A 

D 


1 - e 


T^(X,  0) 

Tj^(X,  0) 


T^(d) 


-Ta(X,  y)D 

(29) 


Granting  we  know  H^(X),  t(X,  0)  can  be  determined  by  direct  radi- 
ance of  the  half-degree  Sun  cone  given  by 

-t(X,  0)  sec  <t> 

H (X)e  ^ -T  (X,  0)sec  (j) 

B = = 1.67  X 10*+  H (A)e  ° ® (30) 

S dV  O 

s 

T (X,  0)  can  be  computed  from  radiosonde  data  for  the  observation 
M 

site  so  that 

T^(X,  0)  = T(X,  0)  - T (X,  0)  (31) 

A M 

Many  techniques  for  obtaining  the  aerosol  characteristics  suggest 
themselves  in  the  light  of  this  analysis.  One  of  the  simplest  and 
the  most  elegant  is  the  method  based  on  the  almucantar  radiance 
measurements/  as  explained  in  the  following  section. 


VII.  ALMUCANTAR  AS  AN  INDICATOR  OF  n(r) 

The  difficulty  of  obtaining  information  about  aerosol  charac- 
teristics from  Eq.  (28)  lies  in  that  the  dependence  of  B on  n(r) , 
m and  T^(y)  is  not  separable.  One  simple  method  of  handling  this 
problem  is  to  make  radiance  measurements  in  the  almucantar 

((j)  = 6)  . 
s 
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For  ^ ^ / 1^7.  = T"  ==  1 so  that  Eq.  (28)  reduces  to 

S A M. 

B(<j)  = (})  , CO,  X)  = G{p  (1|),  X)t  (X,  0)  + P-(i|c,  X)t  (X,  0) 

S MM  A A 

which  is  independent  of  y.  then  be  obtained  from 

almucantar  radiance  measurements  by  rewriting  Eq.  (32)  as 


X)  = 


T^(X,  0) 


B(<))  = (j)^,  (0,  X) 


(32) 


(33) 


Figure  8 shows  a plot  of  the  experimental  phase  function  curve 
normalized  to  unity  at  3^. 

In  the  following  is  described  an  algorithm  for  obtaining 
n(r)  from  A) . If  we  assume  for  m an  average  value  of  say 

1.55  + i(O.O),  it  should  then  be  possible  to  obtain  n(r)  from 
Eq.  (33)  by  either  a numerical  inversion  scheme  or  a model-fitting 
approach.  In  either  case,  theoretical  values  of  A)  obtained 

from  Eq.  (23c)  are  compared  with  experimental  values  of  A) 


Fig.  5.  Experimental  phase  function  curve  obtained  from 
almucantar  radiance  curve  shown  in  Fig,  7.  Dashed  curve  corre- 
sponds to  right-hand  side  of  the  Sun;  solid  curve  to  the  left- 
hand  side. 
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obtained  from  Eq.  (33) . In  the  latter  approach,  a catalog  of  phase 
functions  is  generated  by  use  of  an  analytic  model  for  n(r) , such 
as 


n(r) 


Q 


(p/r)'"  ■ ^ 
e 

P2<e  - 1)  + 1 


2 


1 


V 

r 


(34) 


where  p^,  p^f  P3,  and  v are  adjustable  constants  and  the  efficiency 
factor,  Q = 2,  for  large  r.  Typical  size  distribution  curves  for 
different  values  of  the  parameter  v are  shown  in  Fig.  9.  Any 
other  realistic  analytic  model  of  n(r)  will  do  just  as  well. 

A set  of  phase  function  curves  corresponding  to  values  of  v 
that  range  from  4.0  to  5.2  are  shown  in  Fig.  10.  By  comparing 
the  experimental  phase  function  curve  in  Fig.  8 with  one  of  the 
curves  in  the  catalog,  one  obtains  a reasonably  good  estimate  of 
the  n(r) . If,  however,  one  wants  to  go  a step  further  and  obtain 
the  n(r)  that  gives  the  best  fit  to  the  experimental  phase  function 
curve,  a least  squares  computer  code  is  used.  But  the  cost  of 
such  a computation  is  often  prohibitive,  especially  when  Mie 
theory  is  used,  encouraging  one  to  stop  short  and  settle  for  the 
n(r)  obtained  with  a visual  fit  to  one  of  the  computed  curves  in 
the  catalog . 

VIII.  SOLAlR  aureole  ISOPHOTES  as  indicators  of  n(r)  and  P^(y) 

An  extensive  parametric  computation  of  the  solar  aureole 
isophotes  as  functions  of  n(r)  and  P^(y)  to  some  extent,  of 

m,  has  been  carried  out  by  using  Eqs.  (28)  and  (34)  and  the  aerosol 
number  density  profile  N(0,  y) , such  as  the  one  obtained  from  lidar 
measurements  and  shown  in  Fig.  11.  Everything  else  being  the  same, 
circumsolar  isophotes  corresponding  to  three  values  of  the 
parameter  v of  the  size  distribution  n(r)  (Fig.  9)  are  shown  in 
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Fig.  9-  Typical  size  distribution  curves  for  parameter 
j = 4 .Of  4.4 , and  5.2. 


Fig.  10. 


A set  of  phase  function  curves  corresponding  to 


the  size  distribution  curves  shown  in  Fig.  9 for  v values  in  the 
range  4.0  to  5.2. 


Fig.  11.  Altitude  distribution  of  particle  number  density 
(cm~^)  obtained  from  lidar  observations. 

Figs.  12a,  12b,  and  12c.  It  is  easy  to  see  from  Figs.  9 and  12 
that  by  slightly  increasing  the  number  of  smaller  particles  and 
decreasing  the  number  of  larger  particles  as  v becomes  larger, 
the  shape  of  the  isophotes  undergoes  a dramatic  change . As  the 
value  of  V increases  upwards  from  a value  of  4.0,  the  isophote 
pattern  attains  a shape  increasingly  similar  to  that  of  the 
experimentally  obtained  isophotes  (Fig.  4c) , until  for  a value  of 
V = 5.0  the  computer  generated  pattern  (Fig.  12c)  best  resembles 
the  latter.  Thus,  Fig.  12-  illustrates  the  fine  sensitivity  of  the 
patterns  of  the  circvimsolar  isophotes  to  the  size  distribution 
n(r).  In  contrast,  in  the  case  of  the  lidar  backscattering  ratio 
profile,  defined  by 

F'(7t,  X,  y) 

Rj.  ('IT,  X,  y)  = 1 + (35) 

F' (ir,  X,  y) 

M 
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X , rnm 

(a) 


X , mm 


(c) 


Fig.  12.  Computer  generated  SS  circumsolar  isophotes  for 
different  values  of  the  size  distribution  parameter  vj  (a)  v = 4.0, 
(b)  V = 4.4,  and  (c)  v = 5.0. 
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The  changes  in  corresponding  to  the  changes  in  the  value  of  v 

from  4.0  to  5.2  occur  in  the  peak  values  of  R at  16.5  km  and 

A 

22.5  km  (Fig.  13)  which  is  a feature  not  too  sensitive  to  n(r) 
since  these  differences  can  be  easily  removed  by  adjusting  the 
scaling  parameter  in  n(r) . 


Fig.  13.  Lldar  profiles  of  the  backs cattering  ratio  for 
V =:  4.0,  V = 4.6,  and  v = 5.2  showing  two  layers  of  volcanic  dust 
with  peak  concentrations  at  altitudes  of  16.5  km  and  22.5  km  over 
Hampton , Virginia . 


Attempts  are  underway  to  optimize  computer  programs  that  will 

obtain  the  best  fit  to  the  isophote  pattern  or  numerically  invert 
\ 

the  isophote  data  by  utilizing,  to  their  advantage,  the  fact  that 

radiance  along  each  isophote  remains  constant.  In  this  regard, 

it  is  important  to  keep  in  mind  that  even  though  isophotes  depend 

on  both  n(r)  and  P^(y)  # within  the  aureole  region  the  isophote 

shape  is  more  sensitively  dependent  on  n(r)  than  on  P^(y)  • The 

sensitivity  of  the  isophotes  to  p.{y)  increases  for  larger  angular 

A 

distances  from  the  S\an. 
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IX-  NUMERICAL  INVERSION  METHODS 

Whereas  in  the  modeling  approach  one  starts  with  a model 
judicially  guessed  at  by  experience,  in  the  numerical  inversion 
approach  one  starts  with  an  initial  estimate  of  parameters.  A 
typical  iterative  scheme,  such  as  the  one  described  by  Malchow 
and  Whitney  (Ref-  19),  is  illustrated  in  Fig.  14. 


Fig.  14.  Schematic  representation  of  iterative  inversion  of 
measurements . 

Simulations  of  the  sky  radiance  isophote  shape  or  the  phase  func- 
tion are  performed  by  using  the  radiative  transfer  equation  (such 
as  Eq.  (28)  for  SS)  or  Eq.  (23c)r  respectively,  along  with  the 
initial  set  of  parameters.  Comparisons  are  made  with  the  appro- 
priate measurements.  If  the  discrepancy  is  greater  than  a minimimi 
prescribed  value,  then  a parameter  updating  algorithm  is  used  to 
obtain  a new  estimate  of  parameters.  However,  when  the  convergence 
criteria  is  satisfied,  the  final  estimates  of  parameter  are  assumed 
to  be  accurate.  ' 

In  such  an  iterative  scheme,  the  simulation  relation  (e.g., 
the  radiative  transfer  Eq.  (28))  is  called  several  times  during 
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each  iteration.  And  here  lies  the  problem,  namely,  the  prohibitive 
costs  of  computation.  One  way  to  handle  the  problem  is  to  use 
simple  approximations  for  and  in  terms  of  the  parameters  in 
n(r) , m,  and  N(0,  y) . Exact  Mie  theory  computations  of  scattered 
radiance  when  used  in  an  inversion  scheme  were  found  to  be  pro- 
hibitively expensive.  Thus,  it  is  imperative  that  in  developing 
a radiative  transfer  code  with  the  aim  of  using  it  in  an  inversion 
scheme,  it  must  be  as  computationally  fast  as  possible  in  order 
to  be  of  practical  use  in  numerical  retrievals  of  aerosol  charac- 
teristics. Work  is  in  progress  on  the  development  and  opti- 
mization of  such  inversion  schemes  and  radiative  transfer  programs 
based  on  the  exact  Mie  theory  results. 

X . CONCLUDING  REMARKS 

Under  the  present  state  of  the  art  in  inverting  sky  scat- 
tered radiance,  it  seems  that  for  the  present  the  modeling  approach 
applied  to  the  solar  aureole  measurements  gives  reasonably  good 
estimates  about  the  aerosol  characteristics,  particularly  about  * 
the  size  distribution  n(r) . In  addition,  such  a parametric 
modeling  approach  provides  not  only  a clear  physical  insight  into 
the  problem,  but  also  understanding  of  the  sensitivity  of  the 
individual  aerosol  parameters  that  are  sought.  On  the  other  hand, 
catalog  or  modeling  methods  can  become  very  unwieldy  when  more 
than  one  aerosol  parameter  is  sought.  It  is  precisely  in  such 
cases  that  when  several  parameters  are  simultaneously  sought,  the 
numerical  inversion  schemes  have  an  added  advantage  over  the 
modeling  approach. 

In  spite  of  the  tremendous  upsurge  in  research  activity  in 
this  field  in  recent  years,  much  of  our  present  knowledge  of 
aerosol  characteristics  is  either  on  a localized  scale  or  is 
spatially  and/or  temporally  averaged.  To  acquire  a large  scale 
or  global  view  of  aerosol  characteristics,  their  monitoring  must 
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be  carried  from  aboard  space  platforms.  Greater  endeavor  needs 
to  be  made  in  the  area  of  aerosol  remote  sensing  techniques  and, 
particularly,  on  the  inversion  aspects  of  the  problem. 
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G 
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j 

k 

in 

m* 

m" 

n(r) 

n 
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W 
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r 

s 
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sky  radiance 

direct  solar  radiance 

almucantar  radiance 

= 4A/(1  - A)  , in  Eq.  (8b) 

= sec  (f)  - sec  ^ 

s 

= S ((f)  ) - S ((f)),  in  Eq.  (15) 

J s j 

voliome  scattering  functions.  Prime  denotes  y- 
dependence 

factor  defined  in  Eq.  (18) 
unattenuated  solar  irradiance 
Mie  intensity  functions 

intensity  of  solar  disk  normalized  to  unity  at  its 
center 

= M for  molecules  and  = A for  aerosols 
= 27T/A 

complex  refractive  index  of  aerosols;  m = m*  - im" 
real  part  of  m 
imaginary  part  of  m 

-3  -1 

differential  size  distribution,  cm  ym 

molecular  refractive  index 

nimber  density  of  aerosols  at  y(km)^  cm 

-3 

number  density  of  molecules,  cm. 

adjustable  constants  in  Eq.  (34) 

phase  functions . Prime  denotes  y-dependence 

efficiency  factor 

radius , ym 

solar  radius 

distance  of  scattering  volume  to  the  detector 
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lidar  backscattering  ratio 

"effective  transmission"  function,  defined  in  Eqs- 
(25)  and  (26) 

scattering  volume  of  atmosphere 
integrated  thickness,  defined  in  Eq.  (27) 
distance  along  3C-direction  on  photograph 
altitude,  km 

distance  along  y~direction  on  photograph 
volume  extinction  coefficients  for  the  jth  con- 
stituent • Prime  denotes  y-dependence 
two  •'dimensional  function  of  r and  y 
wavelength 

dimensionless  factor  representing  the  altitude 
dependence  of  jth  constituent, 
total  optical  depth  for  all  constituents 
optical  depth  for  the  jth  constituent 
optical  depths  from  top  of  atmosphere  to  V and 
V to  detector,  respectively 
adjustable  constant  in  Eq.  (34) 

zenith  angles  of  the  Sun  and  detector  view  cone, 
respectively 
scattering  angle 

dihedral  angle  between  the  normals  to  the  Sun  zenith 
and  view  cone  zenith  planes  intersecting  at  the 
scattering  volume 

solid  angles  of  cones  centered  at  the  detector  and 
the  scattering  volume 
half^degree  Sun  cone 
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DISCUSSION 


Fymat:  You  have  mentioned  that  you  are  using  the  solar  aureole, 

but  it. seems  to  me  that  you  are  also  using  lidar  and  multi-wave- 
length extinction.  I am,  therefore,  a little  bit  lost  on  what 
measurements  you  are  employing  and  what  parameters  you  are 
attempting  to  recover. 

Deepak:  Normally,  you  use  a radiometer  to  perform  an  extinction 

measurement  of  the  direct  solar  radiation.  What  I am  suggesting 
is  that  the  extinction  data  are  not  sufficient  to  obtain  the  size 
distribution  accurately.  But,  if  in  addition  one  has  measurements 
of  the  angular  distribution  of  scattered  radiance,  one  gets  a much 
better  idea  of  the  size  distribution.  That  is  one  point. 

Monostatic  lidar  measurements,  on  the  other  hand,  give  you  a good 
vertical  resolution  of  aerosol  backscattering  coefficient  from 
which  information  about  the  altitude  distribution  of  aerosol  con- 
centration can  be  obtained.  Vlhat  I have  shown  is  that  whereas 
the  lidar  backscattering  ratio  data  one  can  fit  with  any  of  the 
size  distributions^ obtained  with  v = 4.0  to  5.2,  the  solar  aureole 
isophote  data  one  could  fit  with  only  one  of  those  size  distri- 
butions. Only  one  of  them  reproduced  all  these  isophote  curves 
including  the  bumps  seen  in  the  curves. 

Fymat:  I have  two  very  brief  questions.  Are  you  doing  all  the 

work  assuming  the  Junge  distribution?  And,  are  you  fitting  for 
one  Junge  parameter? 

Deepak:  Let  me  show  the  size  distribution  function  that  I used, 

but  did  not  have  the  time  to  show  earlier. ^ It  is  a function  with 
four  constants  and  those  curves  of  size  distribution  and  isophote s 
that  you  saw  were  obtained  from  this  function  by  varying  the 
parameter  v.  For  large  particles,  the  size  distribution  behaves  as 
a Junge  distribution.  We  have  one  scaling  parameter  and  three 
adjustable  constants  here.  The  aim  is  to  obtain  the  set  of 
parameters  which  gives  the  best  fit  to  the  experimental  values  of 
the  multispectral  phase  function,  the  isophotes,  the  extinction 
coefficient,  as  well  as  the  lidar  backscattering  ratio.  So  one 
tries  to  best  fit  as  many  different  measurements  obtained  by 
different  methods  by  adjusting  the  parameters  in  order  to  get  at 
the  so-called  "unique"  size  distribution . 

(In  response  to  Dr.  Delrmendjian^s  show  of  hand):  Well,  I must  also 

mention  that  Professor  Deirmendjian  suggested  a similar  approach  to 
aureole  measurements  a long  time  ago  and  tried  to  show  the  same 
thing . 


1 


See  Eq. 


(34)  in  text. 
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Deirmendjian:  Well,  I have  no  coinment  then!  In  1956,  that  is 

what  I wanted  to  show  and  I did  take  Junge-type  distribution  with 
different  slopes  and  I demonstrated  that  the  aureole  would  change. 

Irvine:  You  assumed  an  index  of  refraction? 

Deepak:  Yes,  I did. 

Irvine:  In  some  problems  that  could  make  a big  difference,  but 

I guess  here,  since  you  are  mostly  concerned  with  the  diffraction 
peak,  it  may  not  be  so  sensitive. 

Deepak:  Yes,  that  is  indeed  another  advantage  of  the  forward 

scattering,  that  it  is  not  as  sensitive  to  that  effect.  However, 
it  does  have  some  sensitivity.  I have  done  a parametric  study 
with  about  15  different  indices  of  refraction — varying  the  values 
of  both  the  real  part  as  well  as  the  imaginary  part.  Only  one 
index  of  refraction,  with  a value  of  1.55-i{0),  gave  a good  fit 
to  the  data.  When  I used  a small  value  for  the  imaginary  part, 
the  curves  tended  to  flatten  out.  So,  there  is  that  small  ball- 
park of  error  in  the  index  of  refraction  value,  depending  on  the 
coarseness  of  grid  of  values  chosen. 

Irvine:  It  might  have  been  useful  to  have  some  chemists  at  the 

conference  who  consider  these  sorts  of  inversion  problems  using 
laboratory  data;  that  is,  deducing  hydrosol  properties  from  scat- 
tered intensities  and  polarizations.  You  also  assume  single 
scattering? 

Deepak:  As  a first  approximation,  yes.  That  is  why  this  method 

is  valid  for  relatively  clear  days.  But  I think  perhaps  Jerry 
Twitty  and  definitely  Dr.  Green  have  done  some  work  with  multiple 
scattering  in  the  solar  aureole.  Dr.  Green  might  elucidate 
further  upon  this  point  in  his  talk  later. 

Irvine:  It  would  be  relatively  easy  to  take  these  models  you 

deduce  and  see  how  much  second-order  scattering  there  is  to  see  how 
good  an  approximation  single  scattering  is? 

Deepak:  Yes,  I did  some  calculations  on  that.  Second  order  con- 

tribution was  not  that  much.  It  was  within  5 to  6%  of  the  single 
scattering  contribution. 

Twitty:  With  regard  to  multiple  scattering,  that  is  obviously  a 

function  of  how  much  aerosol  is  actually  in  the  atmosphere.  So 
to  estimate  it  you  have  got  to  say  something  about  your  optical 
depth.  What  Adarsh  has  done  is  essentially  independent  of.  that 
because  he  has  assumed  single  scattering.  So  up  to  whatever  point 
multiple  scattering  becomes  important,  his  results  hold. 
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Deepak:  Up  to  about  optical  depths  of  0.5  or  so,  the  method  is 

good. 

Twitty:  Now  in  the  work  I did  for  my  doctoral  thesis,  radiance 

measurements  along  the  solar  almucantar  in  the  same  geometry  that 
Adarsh  showed,  I did  not  actually  include  any  multiple  scattering 
but  it  can  be  shown  to  what  extent  that  was  important  for  the 
specific  data  that  I used  in  the  analysis,  which  had  quite  low 
optical  depths,  about  0.1  for  the  aerosol  part. 

Herman:  0.01  or  0.1? 

Twitty:  0.1. 

Herman:  What  did  you  find  for  results  on  multiple  scattering? 

o o 

Twitty:  I used  our  data  from  about  3 to  about  25  , which  is  very 
similar  to  what  Adarsh  shows,  and  the  multiple  scattering  is  about 
5%  of  the  radiance  at  25  . So,  I think  it  is  similar  to  other 
people's  results,  including  yours. 

P.  Russell:  I wonder  about  the  sejnsitivity  of  this  method  to  the 

stratospheric  aerosol.  In  particular,  what  happens  to  your  sim- 
ulated isophotes  if  you  just  eliminate  the  stratospheric  aerosol 
from  the  simulation? 

Deepak:  I have  done  a number  of  numerical  parametric  studies  on 

that,  but  did  not  have  time  to  show  the  results.  We  added  aerosol 
layers,  first  two  and  then  three  layers,  and  moved  them  up  and 
down  in  altitude.  The  shift  in  the  isophote  pattern  was  seen  at 
about  15  away  from  , the  Sun.  In  the  near  forward  direction,  no 
noticeable  shift  in  the  shape  of  the  isophotes  was  detected.  But 
for  larger  angles  there  was  definitely  a shift.  One  could  show 
that  introduction  of  upper  aerosol  layers  does  affect  the  shape 
of  isophotes  at  larger  angular  distances  from  the  Sun.  The  iso- 
phote curves  changed  a little  bit  at  angles  beyond  10  or  so, 
even  though  the  aerosol  layer  introduced  into  the  model  was  opti- 
cally very  thin. 

P.  Russell:  Does  that  mean  then  that  the  size  distribution  param- 

eter that  you  extract  from  this  method  is  sort ' of  a mean  parameter 
for  the  tropospheric  and  stratospheric  aerosols? 

Deepak:  Exactly,  all  results  are  averaged  over  the  entire  atmos- 

phere— for  example,  we  obtain  average  refractive  index  and  average 
size  distribution — because  we  are  using  the  assumption  of  separa- 
bility of  the  altitude  size  distribution  here.  ' 

P.  Russell:  Would  the  fact  that  the  stratospheric  aerosols  seem 

to  have  their  effects  in  a different  angular  region  help  you  to 
separate  the  stratospheric  and  tropospheric? 
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Deepak:  Well/  one  of  the  ways  you  can  do  this  is  to  go  up  in  a 

plane  and  take  aureole  photographs,  but  it  is  very  expensive. 

And  that  is  one  of  the  reasons  for  trying  to  improve  the  tech- 
niques of  the  inversion  of  this  ground-based  data  so  that  strato- 
spheric aerosol  effects  can  be  separated.  For  this  purpose,  work 
is  in  progress  on  optimizing  a few  different  inversion  schemes. 

I hope  to  use  some  of  the  techniques  that  have  been  developed  by 
Harvey  Malchow  and  Cindy  Whitney.  The  results  are  not  quite  com- 
plete yet. 

P.  Russell:  It  looks  very  impressive,  very  low  cost. 

van  de  Hulst:  On  the  matter  of  the  influence  of  multiple  scat- 

tering, it  is  possible  to  make  a very  easy  rough  estimate  of  when 
it  becomes  important.  If  I stand  looking  at  the  Sun,  there  is  a 
drop  of  a factor  of  1000  from  the  solar  disk  to  where  the  aureole 
starts.  If  I integrate  over  a solid  angle  of  say  10  times  the 
radius  of  the  Sun,  which  is  a solid  angle  of  100  times  the  Sun, 
then  the  integrated  radiation  coming  from  the  aureole  is  still  a 
factor  10  below  that  of  the  Sun.  That  means  that  multiple  scat- 
tering must  be  somewhere  in  the  order  of  10%.  And  this  holds  for 
me  standing  on  the  ground.  Another  factor  of  two  is  lost  for  the 
average  aerosol  particle  in  the  layer  somewhere  up  in  the  atmos- 
phere. By  this  estimate  in  the  normal  clear  air  situation,  the 
multiple  scattering  is  rarely  important  beyond  some  5%. 

Reagan:  It  was  not  clear  whether  you  were  trying  to  rationalize 

and  obtain  agreement  between  the  aureole  data  and  the  extinction 
and  the  lidar  backscatter  measurements.  I might  add,  amplifying 
on  the  index  of  "ref raction  aspect,  that  the  relationship  between 
the  extinction  coefficient  and  the  backscatter  coefficient  does 
indeed  change  rather  markedly  as  you  change  either  the  real  or 
imaginary  component.  If  you  are  trying  simultaneously  to  make 
those  agree  with  the  aureole  data,  you  would  indeed  have  had  some 
sensitivity  to  index.  Did  you  go  back  and  check  on  this  and 
iterate  in  that  sense? 

Deepak:  The  same  computer  code  calculated  all  the  things  I have 

shown.  I ran  the  programs  for  dif f erent  m * and  m“  values.  The 
shapes  of  the  lidar  backscattering  ratio  curves  came  out  to  be 
nearly  identical  except  for  the  differences  in  magnitude  at  peaks 
of  aerosol  concentration,  which  differences  could  be  removed  by 
adjusting  the  scaling  parameter  in  n(r) . From  this,  I could  not 
clearly  distinguish  between  the  various  size  distributions  for  the 
same  refractive  index.  However,  I feel  that  further  work  needs  to 
be  done  on  this  aspect. 
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ANALYTIC  MODEL  APPROACH  TO  THE  INVERSION 


OF  SCATTERING  DATA 


Alex  E.  S-  Green  and  Kenneth  F.  Klenk 
University  of  Florida 


We  apply  an  analytic  model  approach  which  has  been 
developed  in  nuclear  studies  to  several  simple  atmospheric 
inversion  problems.  We  illustrate  by  past  work  on  the 
solar  aureole  that  this  method  gives  a sharp  determination 
of  aerosol  size  distribution  parameters.  We  show  that  this 
analytic  approach,  together  with  ground  level  point  sampling 
data  measurements,  may  be  used  to  infer  information  on  the 
tropospheric  ozone  profile . 


I.  ATMOSPHERIC  AND  NUCLEAR  OPTICS 

Many  of  us  who  are  now  involved  in  atmospheric  inversion 
problems  were  previously  involved  in  analogous  problems  in  other 
disciplines.  As  is  natural,  we  try  to  bring  to  bear  the  experience, 
sense  of  aesthetics,  or  prejudices,  if  you  will,  which  we  have 
acquired  in  these  other  fields-  The  beauty  of  this  conference  as 
it  is  developing  following  some  of  the  earlier  papers  is  the  sense 
of  open-mindedness  which  has  emerged.  It  is  as  if  this  conference 
has  said,  "Let  a thousand  flowers  blossom." 

In  my  own  (Green)  case,  my  main  prior  involvement  with  inver- 
sion problems  has  been  in  connection  with  two  nuclear  physics  endeav- 
ors based  largely  upon  scattering  data — (1)  inferring  the  nature  of 
the  fundamental  interaction  between  neutrons  and  protons,  and 
(2)  inferring  the  detailed  nature  of  the  nuclear  potential  mani- 
fest in  the  shell  and  optical  models  of  the  nucleus.  Let  me  use 
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the  first  problem  as  an  illustration  of  how  understanding  is 
advanced  in  scattering  inversion  problems. 

When  the  neutron  was  discovered  in  1932,  the  fundamental  prob- 
lem in  nuclear  physics  became  that  of  inferring  the  basic  force 
between  neutrons  and  protons.  The  approach  followed  has  been  to 
perform  scattering  experiments,  i.e.,  fire  neutrons  or  protons  on 
hydrogen  targets,  and  examine  the  emerging  angular  distributions 
(phase  functions)  and  polarizations  at  various  energies  (wave- 
lengths) of  the  outgoing  particles.  The  hope  was  to  be  able  to 
test  various  proposed  two-body  potentials  which  when  inserted  into 
the  Schrddinger  equation  or  Dirac  equation  might  account  for  these 
data  within  statistical  error.  This  was  the  main  line  of  approach 
in  nuclear  physics  until  the  early  1960s  when  the  only  phenomeno- 
logical models  which  could  fit  the  0 to  400  MeV  array  of  scattering 
data  and  auxiliary  data  such  as  the.  properties  of  the  bound  two- 
body  system  (the  deuteron)  were  exceedingly  complex,  requiring 
as  many  as  40  adjustable  parameters  in  their  description. 

A breakthrough  came  in  the  mid-1960s  when  the  discovery  of  the 
0),  p and  n mesons  by  particle  physicists  led  to  the  revival  of 
meson  theory  of  nuclear  forces  initiated  by  Yukawa  in  1935.  With 
the  additional  physical  constraints  of  meson  theory,  it  suddenly 
became  possible  to  fit  the  scattering  data  with  one  boson  exchange 
model  requiring  only  five  to  ten  adjustable  parameters,  rather 
than  the  40  parameters  of  purely  phenomenological  models.  Although 
the  final  story  is  not  yet  told,  the  nuclear  physics  community, 
since  1967  (Refs.  1 and  2),  has  felt  a great  aesthetic  sense  of 
relief  that  the  fundamental  law  of  nuclear  physics  is  not  as  mon- 
strous as  it  had  appeared  to  be  in  the  early  sixties . 

Thus,  as  some  of  the  earlier  speakers  have  already  suggested, 
it  is  the  additional  physics,  physical  judgment, and  physical  infor- 
mation which  one  brings  to  bear  with  the  scattering  data  which  will 
often  determine  the  success  and  utility  of  an  inversion  scheme. 
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Studies  leading  to  the  nuclear  shell  and  optical  models 
(Ref.  3)  are  even  more  analogous  to  the  atmospheric  inversion  prob- 
lem. For  over  40  years,  experiments  have  been  performed  in  which 
..various  nuclear  particles  accelerated  from  1 MeV  to  100  GeV  energy 
range  are  scattered  from  various  nuclear  targets.  Many  lead  to 
optical-type  angular  distributions  (phase  functions) , polarizations, 
scattering,  and  absorption  cross  sections.  Many  of  these  data 
patterns  can  be  accounted  for  by  assuming  a complex  energy  depen- 
dent nuclectr  potential  (complex  wavelength  dependent  index  of 
refraction) . Even  the  terminology  of  this  subject,  such  as  "the 
cloudy  crystal  ball  model,"  reflects  the  light  scattering  analogy. 
Now  nuclear  opticians,  like  atmospheric  opticians,  divide  up  into 
a school  concerned  with  average  gross  properties  and  a school  con- 
cerned with  statistical  fluctuations.  Both  groups  have  greatly 
enriched  the  subject,  although,  as  in  the  light  scattering  case, 
the  communications  between  the  schools  has  not  always  been  the 
best. 

My  own  specialized  pursuits  of  atmospheric  optics  (apart 
from  a stint  in  World  War  II)  began  in  1959  just  after  an  intensive 
involvement  with  nuclear  optics  (Ref.  4) . In  these  pursuits,  I 
have  mostly  used  the  gross  structure-nuclear  optical  modelers 
approach.  The  style  here  has  been  to  use  analytic  models  whose 
parameters  are  determined  by  nonlinear  least  square  adjustment  to 
experimental  data.  Then  we  look  at  the  systematic s of  the  param- 
eters with  the  ultimate  objective  of  relating  them  to  more  funda- 
mental physical  parameters,  e.g. , those  in  the  basic  nuclear  force. 

I would  like  now  to  illustrate  this  nuclear  optical  approach  with 
a few  simple-minded  attacks  on  some  atmospheric  optics  problems. 


II.  AUREOLE  STUDIES 

Deirmendjian  and  Sekera  (Refs.  5 and  6)  very  early  recognized 
the  importance  of  Mie  particles  in  the  theory  of  the  solar  aureole. 
Their  work  was  motivated  by  an  attempt  to  account  for  some  reported 


299 


anomalously  high  transmission  of  the  ultraviolet  part  of  direct 
sunlight.  It  is  interesting  to  note  that  D.  S,  Saxon,  who  col- 
laborated with  Deirmendjian  and  Sekera  on  scattering  from  dielectric 
spheres  (Ref.  7)  during  the  same  time  frame,  played  an  important 
role  in  the  development  of  the  nuclear  optical  model. 

Onr  work,  on  determining  aerosol  size  distributions  from  solar 
aureole  intensities,  has  used  a type  of  atmospheric  modeling  and 
single  scattering  theory  which  we  first  used  in  a satellite  ozone 
sounding  analysis  (Ref.  8) . That  the  skylight  in  the  neighborhood 
of  the  sun  can  be  used  to  good  advantage  may  be  surmised  from  the 
optical  theorem  of  nuclear  and  atomic  scattering  theory  and 
bistatic  radar  analysis  (Ref.  9) . These  works  indicate  that 
whereas  backscatter  cross  sections  (180^  scatter)  vary  in  a 
complex  manner  with  particle  shape  and  index  of  refraction,  for- 
ward scatter  cross  sections  are  primarily  detemined  by  the  voliome 
of  the  particle.  This  property  was  first  utilized  in  the  doctoral 
theses  of  Adarsh  Deepak  (Ref.  10)  and  Barton  J.  Lipofsky  (Ref.  11) 
which,  among  other  things,  involved  photographic  and  photoelectric 
studies  of  the  solar  aureole  in  the  visible  region  (Ref.  12) . More 
recently,  these  studies  have  been  extended  into  the  ultraviolet 
(Refs.  13  and  14) . 

Dr.  Deepak  has  already  described  some  of  the  features  of  this 
work  in  his  talk  on  the  Aureole  Isophote  Method  and  in  connection 
with  his  Stratospheric  Aerosol  Photographic  Experiment  (SAFE) 
proposal.  Let  me  add  some  words  here  oh  the  advantages  of  this 
type  of  experiment . 

In  Fig.  1,  we  show  the  densitometric  traces  of  a measured 
photographic  aureole  made  with  the  obscuring  disc  technique.  The 
traces  labeled  r and  1 are  through  the  solar  almucantar  whereas 
the  traces  labeled  b and  t are  below  and  above  the  sun  in  the  solar 
meridian.  The  beauty  of  this  experiment  is  the  reference  intensity 
provided  by  the  solar  disc,  so  in  essence  we  have  on  our  measuring 
medium  a comparison  between  diffuse  sky  intensity  and  the  direct 
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1.  Densltometric  trace  of  aureole  photograph  with  obscuring  disc  for  horizontal  and  verti- 
with  I,  r,  t,  and  b denoting  positions  of  scans  on  the  left,  right/  top,  and  bottom  sides 


Fig.  2a.  Examples  of  oversize  distribution. 

Fig.  2b.  Corresponding  examples  of  distribution. 

4 

solar  intensity,  attenuated  by  10  by  a Neutral  Densxty  4 (ND4) 
filter.  Thus  far,  in  our  work  at  the  University  of  Florida,  we 
have  only  exploited  the  information  content  in  the  almucantar 
trace  of  the  solar  aureole. 

For  the  convenience  of  analysis,  we  use  an  analytic  size 
distribution  characterized  by  the  cumulative  distribution  function 
(see  Fig.  2) . 
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(a)  V = 3. 

Fig.  3.  Phase  functions  for  model  distribution. 


N(r)  = / [1  + (r/a)'*] 


(1) 


where  v and  a are  adjustable  parameters, 

the  differential  size  distribution 

r . „ V 1 


[1  + (r/a)^]^ 


This  corresponds  to 


(2) 


This  analytic  size  distribution  is  a generalization  of  the  Junge 
power  law  which  is  well  behaved  as  r ^ 0. 
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We  have  calculated  a library  of  normalized  phase  function  for 
this  tvro^parameter  size  distribution  function.  Figure  3 illus- 
trates several  examples . Figure  4 illustrates  some  recent  work 
at  640  nm  showing  the  sharpness  in  the  determination  of  V obtained 
by  this  type  of  analysis  (Ref.  14) . 

Spectral  turbidity  measurements,  such  as  determined  by  a 
multi-channel  Sun  photometer,  also  give  infoinnation  about  the 
particle  size  distribution  (Ref.  15) . Green  and  Sawada  (Ref.  16) 
have  determined  the  relative  spectral  turbidities  associated  with 


(b)  V = 5. 

Fig . 3 . Concl uded . 
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Fig.  4.  Measured  aureole  intensity  along  the  almucantar 
compared  with  calculations  for  three  v variations  in  the  aerosol 
size  distribution.  (From  Ref.  14.) 


aerosol  size  distributions  characterized  by  Eqs.  (1)  and  (2) . 

Thus,  aureole  data  augmented  by  spectral  turbidity  data  can  lead 
to  a very  sharp  determination  of  v,  as  well  as  N^- 

We  have  considered  the  problem  of  multiple  scattering  in  the 
solar  aureole  (Ref.  17) , particularly  in  the  ultraviolet  where 
Rayleigh  scattering  becomes  so  important.  Figure  5 shows  an  inter- 
comparison of  three  calculations  (Ref.  18) : (1)  a single  scat- 

tering treatment,  (2)  a multiple  scattering  calculation  using  Monte 
Carlo  techniques,  and  (3)  a multi-channel  calculation.  The  cal- 
culation indicates  that  multiple  scattering  can  be  cjuite  signifi- 
cant, particularly  in  the  ultraviolet. 

In  subsequent  work,  McPeters  and  Green  (Ref.  14)  have  found 
that  Rayleigh  scattering  is  the  source  of  most  of  this  multiple 
scattering  in  the  solar  aureole  except  at  high  particulate  optical 
depths.  Accordingly,  they  have  proposed  an  analysis  technique 
which  uses  single  aerosol  and  Rayleigh  scattering  augmented  by 
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Fig.  5.  Absolute  intensity  as  a function  of  detector  zenith 
angle  for  two  different  Mie  scattering  functions  as  calculated  by 
three  separate  methods . The  solid  lines  represent  the  results  of 
the  multi-channel  calculations,  the  dots  are  the  Monte  Carlo 
results,  and  the  dashed  curves  are  the  single  scattering  calculations. 
Note  the  break  in  the  scale  to  avoid  superposition  of  the  two  sets 
of  results.  (From  Ref.  IS.) 
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multiple  Rayleigh  scattering  as  determined  by  the  use  of  the  tables 
of  Coulson,  Dave, and  Sekera  (CDS)  (Ref.  19).  Figure  6 is  an  illus- 
tration of  a fit  to  data  based  upon  such  an  analysis. 


III.  DIFFUSE  TO  DIRECT  RATIO  MEASUREMENTS 

In  connection  with  our  ultraviolet  studies  in  support  of  the 
Climatic  Impact  Assessment  Program  (Ref.  20),  we  have  attempted  to 
realistically  characterize  the  diffuse  solar  radiation  or  sky 
radiation  in  the  ultraviolet.  As  it  turns  out,  sky  radiation  in 
the  ultraviolet  is  often  of  greater  biological  consequence  than 
direct  sunlight.  Green,  Sawada  and  Shettle  (Ref.  21)  have  devel- 
oped an  approximate  analytic  formula  which  describes  the  diffuse 
spectral  ir radiance  in  the  ultraviolet  region  by  adapting  a single 
scattering  analysis  to  the  systematics  of  Bener*s  experiments 
(Ref.  22)  and  to  theoretical  calculations  of  Shettle  and  Green 
(Ref-  23) 


0®  5®  10® 

Scattering  Angle 


Fig.  6.  Normalized  ultraviolet  aureole  intensities  along  the 
almucantar  compared  with  modified  single  scattering  calculation, 
with  and  without  the  CDS  Rayleigh  multiple  scattering  correction 
factor,  and  compared  with  a Monte  Carlo  full  multiple  scattering 
calculation.  (From  Ref.  14.) 
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More  recently,  Chai  and  Green  (Ref.  24)  have  recognized  the 
merits  of  measurements  of  the  ratio  of  diffuse  to  direct  spectral 
irradiance  as  a simple  indicator  of  atmospheric  optical  properties. 
The  ratio  method  is  analogous  in  some  respects  to  our  aureole 
method  except  that  there  we  must  attenuate  the  direct  solar  inten* 
sity  by  four  orders  of  magnitude  to  compare  it  to  the  sky  intensity. 
Figure  7 is  an  example  of  measurements  of  the  total  sky  irradiance, 
the  direct  irradiancef  and  the  ratio.  The  diffuse  and  direct  both 
vary  very  markedly,  and  reflect  the  fluctuations  in  the  extra- 
terrestrial solar  spectral  irradiance  and  in  the  ozone  extinction 
coefficients  as  well  as  in  the  wavelength  dependence  of  instrumental 
sensitivity.  However,  the  diffuse  to  direct  ratio  is  only  slowly 
varying  but  still  sensitively  depends  upon  such  interesting  charac- 
teristics as  atmospheric  particulate  loading,  ground  albedo,  and 
sky  cover.  This  ratio  method  avoids  problems  associated  with  the 
difficulty  of  absolute  spectral  irradiance  measurements  which,  at 
this  time,  are  limited  to  about  8%  in  the  ultraviolet  region. 

It  should  be  remarked  that  Herman,  et  al.  (Ref.  25)  have 
theoretically  examined  such  a ratio  method  in  the  visible  region 
in  an  attempt  to  estimate  the  imaginary  part  of  the  index  of 
refraction  of  atmospheric  aerosols. 

We  shall  next  consider  another  example  of  inverting  of  optical 
data  with  the  aid  of  the  analytic  modeling  approach  and  auxiliary 
information,  such  as  may  be  obtained  with  simple  ground-based 
inst2mments. 

IV.  THE  INVERSION  OF  THE  LOW  ALTITUDE  OZONE  PROFILE 

Tropospheric  ozone  is  a constantly  varying  atmospheric  com- 
ponent which  changes  with  the  season  of  the  year,  location,  and 
time  of  day.  Both  photochemical  and  stratospheric  transport  pro- 
cesses are  important  sources  of  tropos;^heric  ozone  and  are  respond 
sible  for  the  strong  diurnal , seasonal, and  spatial  variations . 
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Fig.  7.  Total,  diffuse,  direct,  and  the  ratio  of  diffuse  to 
direct  components  of  solar  irradiances  as  a function  of  wavelength 
as  recorded  on  31  July  1975  at  around  13:50  EST.  The  magnitudes 
of  the  irradiances  are  in  relative  scale;  they  are  proportional  to 
the  photomultiplier  output  in  100  yy.  (From  Ref.  24..) 


A systematic  program  of  balloon-borne  ozonesonde  observations 
has  provided  valuable  data  on  the  altitude  structure  of  ozone  which 
can  be  used  as  a guide  in  constructing  a versatile  analytic  model 
(Ref.  26) . We  consider  in  what  follows  the  possibility  of  infer- 
ring the  tropospheric  ozone  profile  from  diffuse  to  direct  ratio 
measurements  in  the  middle  ultraviolet  in  conjunction  with  ozone 
point-sampling  at  the  ground.  We  model  the  altitude  profiles  of 
the  atmospheric  components  with  analytic  functions  because  such 
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a technique  simplifies  data  inversion  and  provides  a convenient 
way  of  communicating  the  resulting  profiles. 


Green  (Ref.  8)  has  modeled  the  stratospheric  ozone  coliomn 
density  as  a function  of  altitude  y with  a distribution  used 
extensively  in  the  nuclear  studies  (the  so-called  Wood-Saxon 
function) (Refs.  3 and  4) 


w(y) 


w^(l  + e ^o/h) 
1 + 


(3) 


Here  w^  is  the  total  ozone  thickness  and  y^  and  h are  parameters. 
The  density  profile  p (y)  is  given  by 
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The  parameter  y^  is  the  altitude  at  which  the  density  function 

peaks  and  h scales  the  width  of  the  distribution.  Green  (Ref.  2) 

shows  how  y^,  h and  w^  can  be  approximately  inferred  from  solar 

backscatter  ultraviolet  (UV)  measurements.  In  their  recent 

analysis  on  ground  level  UV,  Shettle  and  Green  (Ref.  23)  add  an 

exponential  term  to  this  function  to  allow  for  the  tropospheric 

ozone  component.  Here  to  characterize  tropospheric  ozone  profiles 

which  are  concave,  i.e.,  the  density  decreases  with  altitude  above 

the  ground  and  then  increasing  at  the  tropopause,  we  add  a second 

teirm  of  the  form  of  Eq.  (3)  or  (4)  with  the  parameters  y^*  and  h’ 

and  where  w is  now  the  sum  of  w + w * . 

T o o ■ 

In  Fig.  8,  several  profiles  corresponding  to  various  values 

of  h*  and  p(0)  are  shown.  Here  y * = 0 and  w =0.29  atm-cm;  and 

o o 

y^  and  h are  set  to  23  km  and  4 km,  respectively.  Extreme  con- 
cave profiles  can  be  obtained  as  well  as  curves  of  almost  constant 
density.  Furthermore,  convex  profiles  can  be  generated  with  two 
distributions  by  setting  y^*  to  be  a positive  nxmber.  Convex  pro- 
files are  observed  with  greatest  frequency  in  the  simimer  months  at 
latitudes  above  40^N. 
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Fig.  8,  Ozone  density  profiles  for  a number  of  different 
values  of  the  parameter  and  ground  density ^ with  y^  = 23  km, 

h = 4 km  and  y =0. 


Assuming  we  know  the  ground  level  ozone  density  and  also  that 

the  ozone  profile  can  be  represented  by  a two  component  model  with 

y*  = 0,  we  can  test  the  sensitivity  of  the  diffuse  to  direct  ratio 

to  the  parameter  h* . In  Pig.  9,  we  plot  the  ratio  versus  h*  for 

p(0)  = 40,  70,  100  (yg/m^)  for  a wavelength  of  300  nm.  We  take 

y = 23  km  and  h =4  km.  Furthermore,  we  have  assumed  the  air 
o o 

and  aerosol  profiles  of  Shettle  and  Green  (Ref-  23)  which  are  based 
on  Elterman's  1964  data  (Ref.  27).  The  total  aerosol  optical 
depth  is  0-411;  the  aerosol  is  characterized  by  the  cumulative 
size  distribution  given  by  Eq.  (1)  with  v = 3 and  a =0.03  ym;  the 
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Fig.  9.  Diffuse  to  direct  ratio  dependence  on  the  scale 

q 

parameter  h’  for  ground  ozone  densities  of  40,  70, and  100  yg/in  . 

aerosol  index  of  refraction  is  1.5  + .Oli;  a ground  albedo  of  0.05 
is  assumed;  and  the  Sun  is  directly  overhead.  The  ratios  are  cal- 
culated by  using  the  multiple  scattering  technique  of  Shettle  and 
Green  (Ref.  23) . 

The  diffuse  to  direct  ratio  then  can  be  used  to  infer  a value 
of  h*  from  a set  of  curves  as  in  Fig.  9 when  used  with  point 
sampling  measurements  which  give  the  ozone  cone  entreat  ion  at  the 
ground. 

Other  simple  and  inexpensive  measurements  can  be  made  simul^ 
taneously  with  the  ratio  and  point- sampling  measurements  in  order 
to  further  delimit  the  inversion  and  reduce  the  uncertainties.  For 
example,  aureole  photography  can  be  employed  to  infer  the  aerosol 
size  distribution  parameters  (Refs.  10,  12,  and  14) . Also, 
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multi-wavelength  photometry  provides  valuable  infomation  on  the 
aerosol  optical  thickness  and  its  wavelength  dependence  (Refs.  15 
and  16) . 

The  two  most  nebulous  parameters  are  the  aerosol  single 
scatter  albedo  and  the  ground  albedo.  It  is  important  to  know  how 
uncertainties  in  these  parameters  propagate  through  the  inversion. 
The  aerosol  single  scatter  albedo  will  depend  on  the  aerosol  index 
of  refraction  and  size  distribution.  By  using  aureole  photography 
to  pin  down  the  size  distribution,  one  can  draw  on  experience  from 
previous  investigations  to  fix  the  aerosol  index  of  refraction 
within  a range  of  confidence.  For  example,  bistatic  laser  and 
aureole  photography  methods  (Ref.  28)  indicate  that  the  index  of 
refraction  of  a typical  Gainesville  aerosol  is  0.005  ± 0.005  for 
the  imaginary  part  and  1.50  ± 0.05  for  the  real  part.  The  diffuse 
to  direct  ratio  is  found  to  be  relatively  insensitive  to  this  range 
of  possible  error.  Similar  considerations  apply  to  the  ground 
albedo.  In  the  theoretical  model  a ground  albedo  of  0.05  was 
assumed  which  is  compatible  with  measurements  by  Furukawa  and  Heath 
(unpublished  reports,  1973)of  various  natural  surfaces  for  the  wave- 
length region  310  to  380  nm.  For  example,  they  found  that  for  scrub 
desert  the  ground  albedo  was  0.04  over  the  310  to  380  nm  region. 

For  farmland,  70%  tilled  and  30%  covered  with  vegetation,  the  ground 
albedo  was  found  to  be  0.07  to  0.08  for  the  310  to  340  nm  region. 

Small  errors  in  the  ground  albedo  (~0.02)  do  not  significantly 
affect  the  calculations.  Furthermore,  once  the  ground  albedo  is 
known  for  a given  location,  the  daily  variations  of  h*  can  be 
detemined,  unless,  of  course,  the  surface  changes  because  of  snow 
cover,  cultivation,  or  the  like. 

If  the  true  optical  depth  is  used  in  the  inversion,  then 
underestimating  the  ground  albedo  or  the  aerosol  single-scatter 
albedo  will  lead  to  calculated  ratios  which  are  too  small. 
Overestimation  will  lead  to  calculated  ratios  which  are  too  large. 
Diffuse  to  direct  ratio  measurements  in  the  320  to  340  nm  region 
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can  be  used  to  infer  effective  aerosol  optical  depths  here  since 
in  this  region  the  ratios  are  rather  insensitive  to  tropospheric 
ozone.  The  effective  optical  depths  Ccin  be  extrapolated  to  smaller 
wavelengths.  These  effective  optical  depths  will  be  somewhat  dif- 
ferent from  the  true  aerosol  optical  depths  and  will  tend  to  com- 
pensate for  errors  in  the  ground  albedo  and  aerosol  single- 
scatter albedo.  The  diffuse  to  direct  ratio  around  300  to  320  nm 
is  insensitive  to  the  altitude  distribution  of  the  aerosols  so 
long  as  the  ozone  profile  near  the  ground  is  not  changing  too 
rapidly.  Also,  detailed  knowledge  of  the  stratospheric  ozone 
structure  or  thickness  is  not  required - 

V.  CONCLUDING  REMARKS 

Strictly  numerical  methods  of  inversion  are  becoming  pre- 
dominant in  remote  sensing  these  days.  These  are,  of  course, 
valuable  to  infer  the  irregularities  and  statistical  fluctuations 
in  atmospheric  properties.  The  analytic  model  method  which  we 
have  illustrated  can  be  a valuable  supplement  to  such  niimerical 
methods.  They  are  particularly  useful  when  used  in  conjunction 
with  dynamical  models  of  atmosphere  structure  because  of  the 
additional  physical  input  of  such  models.  When  the  models  are 
joined  to  ground-based  point  sampling  data,  this  remote-sensing- 
analytic  model  approach  gives  approximate  answers  to  important 
questions  involved  in  many  public  pplicy  decisions  on  atmospheric 
pollution. 


SYMBOL 

a size  distribution  parameter,  corresponds  approximately  to 

the  size  of  particle  where  n(r)  peaks 
h ozone  distribution  parameter  which  is  proportional  to 

width  of  ozone  density  function;  prime  cjenotes  another 
value  of  h 

m real  part  of  aerosol  refractive  index 
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I absolute  scattered  intensity 

s 

n(r)  differential  aerosol  size  distribution 

N(r)  cumulative  aerosol  size  distribution 

size  distribution  parameter  which  is  equal  to  total  num- 
ber of  aerosol  particles 
r aerosol  particle  radius 

w total  ozone  thickness;  prime  denotes  another  value  of  w 

w(y)  the  ozone  thickness  function 

y altitude 

y^  altitude  at  which  ozone  density  peaks;  prime  denotes 

another  value  of  y 

o 

ZA  zenith  angle 

0 scattering  angle 

V size  distribution  parameter  which  determines  power  law 

dependence  of  n(r)  at  large  r 
p (y)  ozone  density  function 

T optical  depth 
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DISCUSSIONS 


Delrmendjian:  With  regard  to  the  importance  of  multiple  scattering 

in  the  aureole,  I don't  want  to  blow  my  horn,  but  I certainly  wish 
to  protect  Professor  Sekera's  memory.  In  my  1956  doctoral  thesis, 

I looked  at  the  problem  of  the  solar  aureole  at  his  suggestion. 

The  method  used  was  essentially  Sekera's  idea  and  consisted  of 
treating  the  aureole  as  a perturbation  on  the  Rayleigh  multiple 
scattered  skylight.  And  that  is  precisely  what  I did.  Have  you 
seen  my  original  1957  paper  on  the  aureole? 

Green:  Yes,  we  have  seen  your  paper. 

Delrmendjian:  It  was  exactly  that.  You  increase  the  optical 

thickness  of  the  Rayleigh  atmosphere  by  adding  a perturbation 
optical  thickness  due  to  the  aerosols.  When  you  do  that,  it  is  a 
kind  of  hybrid  method  where  single  scattering  on  the  aerosols  pro- 
duces the  aureole,  but  multiple  scattering,  mainly  on  the  Rayleigh 
particles,  produces  the  rest  of  the  background  skylight.  I thought 
you  didn't  make  that  clear. 

Green:  We  weren't  aware  of  that  aspect  of  your  paper  back  then. 

We  were  aware  of  your  work  on  the  aureole  and  we  have  quoted  it  in 
our  work. 

Delrmendjian:  I think  that  was  the  principal  point.  I have  since 

reexamined  this  method  in  1970.  I did  not  publish  the  results  -in 
the  open  literature  due  to  lack  of  funds.  But  they  are  available 
in  a formal  1970  Rand  Corporation  report,  in  which  I introduced 
new  phase  functions.  Indeed,  the  curves  that  are  obtained  look 
very  much  like  some  of  the  measurements  I have  seen.  Subsequently, 
I intended  to  compare  them  with  the  measurements  and  look  into 
their  use  to  get  information  about  the  aerosol  size  distribution. 

At  the  time,  I was  unable  to  do  this  for  lack  of  support. 

Green:  Well,  there  was  no  intention  to  slight  you  or  Professor 
Sekera.  We  have  in  our  work  in  this  area  acknowledged  this.  The 
UV  problem  has  a new  interest  in  light  of  its  biological  aspects. 

So  part  of  this  work  was  directed  toward  answering  some  particular 
questions  about  the  radiance  of  the  UV  aureole  where  some  serious 
problems  remained.  My  point  about  the  low  altitude  ozone  distri- 
bution is  that  you  can  take  advantage  of  the  extra  scattering 
associated  with  aerosols  to  extend  the  path  and  the  sibsorption  in 
the  low  altitude  ozone  layer.  Thus,  you  can  actually  drag  out  a 
little  information  about  the  high  altitude  distribution,  but  little 
about  the  low  altitude  profile. 

Weinman:  While  I agree  with  the  previous  speaker  that  the  aureole 

technique  is  a powerful  one  for  determining  size  distributions,  I 
should  point  out  that  invisible  cirrus  clouds  can  plague  this 
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technique.  While  there  are  frequent  cases  where  you  can  get  nice 
smooth  functions  to  which  you  can  apply  the  theory,  this  hazard 
always  exists-  If  one  looks  at  this  data,  one  can  pick  up  cirrus 
clouds  which  are  not  at  all  visible  to  the  naked  eye.  I think 
that  it  takes  a certain  amount  of  judicious  discrimination  to  get 
cases  to  which  you  can  apply  the  radiative  transfer  theory. 

Green:  I think  Adarsh  has  shown  some  results  where  contours'  of 

isodensity  on  a film  show  distortions  that  are  suggestive  of  a 
thin  cloud  layer.  In  some  of  our  work,  we  have  noted  some  problems 
of  that  nature,  which  one  can,  perhaps,  use  as  information  content. 

Deepak:  Yes,  we  have  been  looking  at  the  shapes  of  the  solar 

aureole  isophotes  rather  carefully  and  we  see  small  bumps.  The 
isophote  curves  are  not  very  smooth.  That  could  be  due  to  the 
presence  of  thin  clouds.  In  fact,  we  have  an  airport  nearby; 
when  aircraft  take  off  we  can  easily  detect  the  presence  of  con- 
trails from  the  systematic  distortion  in  the  shape  of  the  isophotes 
over  the  region  of  the  contrail  image. 

Fymat:  I was  very  interested  in  your  conclusion  in  determining  the 

scale  height  for  the  troposphere. 

Green:  For  the  tropospheric  ozone  distribution. 

Fymat:  Yes,  but  you  need,  as  you  say,  the  ozone  distribution  at 

the  lower  altitudes.  However,  it  seemed  to  me,  and  Dr.  Mateer  may 
care  to  comment  on  it,  that  the  actual  ground-based  Dobson  measure- 
ments are  really  being  phased  out  in  view  of  the  fact  that  they 
produce  different  results  on  inversion  using  different  methods. 

So,  if  we  cannot  get  the  distribution  at  the  lower  altitudes,  how 
practical  is  your  conclusion^ 

Green:  Well,  insofar  as  profile,  the  Umkehr  method  (not  the  Dobson 
method)  relies,  and  Carl  can  correct  me  since,Jie  is  much  more 
expert,  on  the  setting  of  the  sun  and  inverting  sky  radiances.  Our 
method  could  be  worked  at  high  noon  since  it  is  instantaneous.  We 
would  use  wavelength  information  and  we  have,  in  effect,  shown' by 
simulated  tests  that  the  diffuse  to  direct  ratio  is  only  sensitive 
to  the  addition  of  extra  ozone  below  about  10  kilometers.  So  that 
while  it  is  not  a complete  profile,  if  you  use'*^t  in  conjunction 
with  ground  level  chemical  measurement,  it  gives  you  information 
which  relates  to  the  lower  atmospheric  ozone  and  it  is  insensitive 
to  the  stratospheric  ozone  which  is  far  more  abundant.  Now  am  I 
correct  that  the  Umkehr  method  more  or  less  gives  you  the  most 
nformation  about  the  higher  atmospheric  ozone?  So  there  is  no 
^.al  contradiction  with  anything  previously  known. 
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Mateer:  The  Umkehr  measurement  is  a different  kind  of  measurement* 
It  is  a ratio  measurement  of  two  wavelengths  •'  as  opposed  to  what  you 
are  doing — a ratio  of  diffuse  to  direct  sunlight. 

Green:  Yes-  We  also  are  helped  if  we  use  two  wavelengths  as  well* 
It  eliminates  a little  bit  of  the  absolute  radiometry  problem.  It 
gives  us  another  check. 
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OPEN  DISCUSSIONS  I 


King:  Alex,  what  do  you  think  we  can  learn  about  solving  the 

atmospheric  temperature  inversion  problem  from  the  similar  type 
inversion  problem  in  inferring  nuclear  potential  in  nuclear  physics? 

Green:  Well,  I was  discussing  this  type  of  thing  with  Dr,  Wark, 

and  the  analogy  would  be  if  we  brought  to  bear  to  the  problem  all 
the  physical  dynamics  data,  not  only  at  one  location  but  at  adja- 
cent locations.  Thus,  if  we  had  relatively  few  questions  that  we 
posed  to  the  radiative  transfer  problem,  for  example,  to  get  hori- 
zontal gradients  in  temperature  profile,  and  perhaps  include  such 
facts  that  there  tends  to  be  a tropopause  and  we  are  mainly  inter- 
ested where  the  temperature  break  occurs,  and  ask  questions  which 
embody  all  of  our  experience  as  to  reasonable  temperature  profiles. 
Then  when  we  apply  this  external  information  to  an  inversion  prob- 
lem, it  is  usually  much  easier  to  come  to  a physically  reasonable 
answer.  Now  I don't  know  if  that's  a good  paraphrase  of  the 
analogy.  We  do  have  one  problem  which  I have  to  confess  compromises 
the  analogy.  In  the  case  of  the  nuclear  force  problem  we  ifhink  all 
neutrons  and  protons  are  the  same,  and  unfortunately  the  atmosphere 
takes  on  so  many  different  states  that  we  do  have  that  extra  com- 
plication. Thus,  if  you  tried  to  unfold  many,  many  details  of  the 
instantaneous  atmosphere,  you  may  be  in  trouble  with  this  approach. 
On  the  other  hand,  if  you  are  satisfied  with  answering  the  types  of 
physical  questions  that  our  dynamical  meteorologists,  pollution 
experts,  or  biologists  want  answered,  I think  you  get  a lot  of ‘use- 
ful information  by  these  modeling  approaches. 

Herman:  Alex,  I would  like  to  ask  you,  what  is  the  sensitivity  of 

the  direct  diffuse — insofar  as  solving  for  ozone — to  variations  in 
aerosol  contents?  Because  as  you  know  we  are  trying  to  apply  it  to 
learn  various  parameters  of  the  aerosol.  Now  you  are  using  it  to 
learn  about  the  ozone  but  it  still  has  a sensitivity  to  aerosols 
even  at  the  ultraviolet  wavelengths. 

Green:  Yes,  I should,  unless  I commit  another  error  of  omission, 

mention  that  Ben  has  used  the  direct-to-dif fuse  ratio  to  infer  the 
imaginary  part  of  the  index  of  refraction  working  at  500  nanometers. 
We  were  not  aware  of  this  work  when  we  started  ours,  but  were  aware 
of  it  during  its  course.  I think  the  trick  in  our  case  is  to 
choose  two  wavelengths — one  in  the  region  where  the  ozone  is 
absorbing  and  one  where  it  is  nonabsorbing-  Then  we  take  advantage 
of  whatever  we  know  about  our  aerosols,  including  a ground-level 
aerosol  measuring  device,  plus  an  aureole  measuring  device.  I 
would  not  go  to  anything  expensive  like  a lidar.  But  I think  if 
we  bound  our  aerosol  model  somewhat  we  find  by  sensitivity  analyses 
that  we  do  get  some  information  about  the  low  level  ozone.  This  is 
what  we  would  pick  up  by  using  two  wavelengths,  just  as  Dobson  does. 
We  would  go  to  about  320  nanometers  and  then  to  305  nanometers.  We 
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would  expect  that  the  aerosols  do  not  change  very  much,  so  that  we 
hold  the  aerosols  confusion  factor  down. 

Goulard:  I am  trying  to  apply  inversion  techniques  to  the  field 

of  combustion.  I get  the  profiles  of  temperatiire  or  concentrations 
following  the  methods  that  Dr.  Chahine  has  developed.  But,  I get 
them  in  terms  of  optical  thickness.  And  when  I want  to  convert  it 
to  physical  thickness,  which  is  what  the  combustion  people  want,  I 
find  that  I cannot  depend  on  the  constant  concentration  of  CO2  as 
you  do  in  the  atmosphere.  Could  someone  give  me  some  help  on  how 
to  get  around  this  problem  in  frequency  scanning  only? 

Chahine:  Let  me  answer  Professor  Goulard's  question  on  the  solu- 

tion in  terms  of  the  optical  depth.  You  know,  the  independent 
variable  in  the  radiative  transfer  equation  is  x but  we  set  dx  = 
(9x/9z)dz  in  the  equation  and  obtain  a solution  as  a function  of 
the  physical  scale  z.  As  you  know  3x/9z  is  obtained  on  the  basis  of 
an  atmospheric  model.  Thus,  the  solution  is  truly  a function  of. 

X although  it  is  usually  presented  as  a function  of  z.  The  trans- 
formation from  the  x-scale  to  the  z-scale  is  done  through  a model. 

Goulard:  You  assume  Laplace  Law  of  the  atmosphere  and  a constant 

mixing  ratio? 

Chahine:  Yes,  we  have  to  do  this. 

Goulard:  And  for  a flame? 

Chahine:  You  would  have  to  develop  a model  for  your  flame  and 

correlate  your  optical  depths  with  a physical  scale. 

Goulard:  Is  the  concentration  of  CO2  really  constant  throughout 

the  atmosphere?  ^ 

Green:  Well,  it  is,  yes,  within  a small  period  of  time.  There 

are  some  people  who  think  it  is  growing. 

Gille:  The  concentration  of  CO2  in  the  troposphere  is  quite  con- 

stant although  not  absolutely  so.  It  varies  seasonally  by  perhaps 
one  part  per  million  out  of  a background  of  330  with  a long-term 
trend  of,  perhaps,  0,7  ppm  per  year. 
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COMPARISON  OF  LINEAR  INVERSION  METHODS  BY 


EXAMINATION  OF  THE  DUALITY  BETWEEN 
ITERATIVE  AND  INVERSE 
MATRIX  METHODS 


Henry  E.  Fleming 

National  Environmental  Satellite  Servicer  NOAA 


Linear  numerical  inversion  methods  applied  to  atmospheric 
remote  sounding  generally  can  be  categorized  in  two  ways: 

(1)  iterative r and  (2)  inverse  matrix  methods.  However,  these 
two  categories  are  not  unrelated;  a duality  exists  between 
them.  In  other  words,  given  an  iterative  scheme,  a corre- 
sponding inverse  matrix  method  exists,  and  conversely.  This 
duality  concept  is  developed  for  the  more  familiar  linear 
methods.  The  iterative  duals  are  compared  with  the  classical 
linear  iterative  approaches  and  their  differences  analyzed. 

The  importance  of  the  initial  profile  in  all  methods  is 
stressed.  Calculations  using  simulated  data  are  made  to  com- 
pare accuracies  and  to  examine  the  dependence  of  the  solution 
on  the  initial  profile. 


I • INTRODUCTION 

When  working  with  different  linear  inversion  methods,  one 
finds  that  some  of  the  iterative  and  inverse  matrix  methods  yield 
similar  accuracies.  Therefore,  one  might  conjecture  that  these 
methods  are  simply  duals  of  each  other,  that  is,  given  am  itera- 
tive method,  there  exists  a corresponding  inverse  matrix  method 
that  yields  the  same  answer,  emd  conversely.  It  turns  out  that 
this  is  true  for  a significant  class  of  linear  inversion  methods. 
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Furthermore,  the  iterative  methods  are  not  quite  the  clas- 
sical ones  in  that  they  are  not  iterated  to  convergence.  Instead, 
they  are  iterated  only  until  the  residual  variances  are  equal  to 
the  variance  of  the  instrumental  noise.  This  leads  to  solutions 
that  are  different  from  the  classical  ones.  Consequently,  the 
duality  between  methods  is  not  exact,  that  is,  the  solutions  to 
the  dual  equations  agree  only  to  within -a  certain  degree  of 
accuracy.  This  requires  us  to  speak  of  "virtual  duality"  instead 
of  duality  in  the  exact  sense. 

The  general  outline  of  the  paper  is  to  first  develop  the 
duality  principal  for  the  least  squares  solution.  For  this  case 
the  duality  is  exact,  but  this  solution  is  unstable.  Hence,  we 
must  resort  to  regularized  solutions  for  which  we  can  have  only 
virtual  duality. 

The  principle  of  virtual  duality  first  is  established  for 
the  Twomey-Phillips  solution.  Then,  the  results  are  extended  to 
a more  general  class  of  solutions.  Finally,  the  question  of  the 
dependence  of  the  solution  on  the  initial  profile  is  addressed. 
Results  of  calculations  using  simulated  data  are  given  at  appro- 
priate places  throughout  the  paper. 

II.  THE  DUALITY  PRINCIPLE 

A.  The  Least  Squares  Solution 

Most  atmospheric  remote  sounding  problems  are,* or  can  be 
reduced  to,  a Fredholm  integral  equation  of  the  first  kind,  which 
can  be  approximated  numerically  by  a linear  system  of  equations 
(Ref.  1)  of  the  form 

A X = y (1) 

The  n X m (m  > n)  matrix  A is  the  matrix  of  weighting  functions, 

'Xi 

X is  the  m-dimensional  source-function  vector,  and  y is  the  vector 
of  n measured  values. 
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To  illustrate  the  duality  principle,  we  begin  with  the  least 
squares  solution  of  Eq.  (1) , namely# 


where  the  superscript  T denotes  the  transposed  matrix.  Of  the 
infinite  number  of  solutions  to  Eq.  (1) , the  solution  in  Eq.  (2) 
is  the  unique  solution  which  has  minimum  Euclidean  norm  and  exists 
because  the  weighting  functions  are  chosen  deliberately  to  be 
linearly  independent.  Since  A has  full  rank  (i.e.,  rank  n) , this 
solution  is  also  the  generalized  inverse  (or  Moore-Penrose  pseudo- 
inverse) solution  of  Eq.  (1) . 

On  the  other  hand,  the  classical  iterative  solution  (i.e., 
solution  by  successive  approximations)  to  Eq.  (1)  is  given  by 

x=x  +aA(y-Ax  ) (3) 

where  x is  the  kth  successive  approximation  to  x and  a > 0 is  a 
convergence  (relaxation)  factor.  We  say  that  Eq.  (3)  is  the  dual 
of  Eq.  (2) , and  conversely,  because  in  the  limit  Eq.  (3)  yields 
the  solution  in  Eq.  (2) , and  conversely.  This,  then,  is  an 
example  of  what  we  mean  by  the  duality  principle  between  inverse 
matrix  and  iterative  inversion  methods. 


To  show  that  Eq.  (3)  is  the  dual  of  Eq.  (2) , we  write  Bq.  (3) 
in  terms  of  the  initial  approximation  vector  x^  by  combining 
iterations  to  arrive  at  the  following  equations: 
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where  we  used  the  identity 


I + (I  - B)  + 

O/  Of  <\| 


+ (I  - B) 


k - 1 


B = I 


(I  - B) 


and  we  repeatedly  used  the  identity 

k y 

m T T T 

(I  - aA  A)  A = A (I  - aAA  ) 
0(111  o o o o on  oo 


(5) 


(6) 


and  where  denotes  the  ii-dimensional  identity  matrix.  Note  that 
a superscript  on  a vector  (a  lower  case  letter)  is  an  iteration 
index  while  a superscript  on  a matrix  (a  capital  letter)  repre- 
sents a power. 

Now  the  iterative  scheme  in  Eq.  (4)  converges  if  the  Euclidean 
noicm  of  the  matrix  difference  with  the  exponent  k in  the  last  line 
is  less  than  unity.  Since 


II  T II 

a II  A A 

II  I II 

< I 

1 I - 

aA*^A  11 

< 1 

o o 

' OTTI 

o o 

and  since  ||  I ||  = 1,  we  have 

oTtl 

0 < all  a'^A  II  <2. 

o o 

Under  the  condition  that  a < 2/I|a'^a||  , Eq.  (4)  becomes 

o o 

k T T ^ 

lim  X = A (AA  ) y = x.  ^ 

~ o oo 

k ^ ® 

Since  this  solution  agrees  with  the  solution  ifl  Eq.  (2),  we  have 
established  the  exact  duality  between  Eqs.  (2)  and  (3). 

While  Eqs.  (2)  and  (3)  are  an  example  of  the  duality  prin- 
ciple, in  practice,  neither  solution  is  of  much.vivalue  because  the 
T 

matrix  AA  is  generally  ill-conditioned  with  respect  to  matrix 
oo 

inversion.  The  ill-conditioning  arises  from  the  fact  that 

typically  the  weighting  functions,  which  are  the  rows  of  A,  are 

o 

very  smooth  and  broad,  and,  hence,  overlapping.  Furthermore,  the 
vector  y is  always  contaminated  by  measurement  noise  which  is 
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T 

amplified  by  (AA  ) . Consequently,  the  solutions  in  Eqs.  (2)  and 

(3)  are  unstable  and,  hence,  of  no  use  in  remote  sounding  appli- 
cations. 

Figure  1 shows  just  how  unstable  least  squares  (generalized 
inverse)  solutions  can  be.  It  is  based  on  139  radiosonde  profiles 
from  March  1973  which  were  almost  uniformly  distributed  between 


Fig.  i.  Profile  of  the  rms  errors  between  the  least  squares 
solutions  and  radiosondes  for  139  simulated  cases.  (1  bar  = 100  kPa.) 
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the  equator  and  70^  N.  Clear  radiances  were  calculated  for  the 
six  CO^  Vertical  Temperature  Profile  Radiometer  (VTPR)  channels 
and  were  contaminated  with  noise  by  a random  number  generator 
using  a normal  distribution  with  a standard  deviation  of  0.25  mW/ 
(m^  sr  cm  ^),  These  radiances  were  then  used  in  the  least  squares 
retrievals.  The  curve  in  Fig.  1 represents  the  139  rms  tempera- 
ture differences  (at  the  15  mandatory  pressure  levels  indicated) 
between  the  retrieved  profiles  and  the  radiosonde  profiles  (the 
"truth").  Clearly,  these  rms  errors  are  unacceptably  large — they 
vary  from  around  3°C  at  three  levels  to  19.2^C  at  1000  mb.  (1  bar 
= 100  kPa.)  To  obtain  stable  and  physically  realistic  solutions 
to  Eq.  (1) , one  must  resort  to  regularization  methods  which  sta- 
bilize the  ill-conditioned  matrix  that  is  to  be  inverted. 


B . The  Twomey-Phillips  Solution 

The  first  regularized  solution  of  Eq.  (1)  that  we  will  con- 
sider is  that  of  Twomey  and  Phillips  (Ref.  2) , which  is  also  known 
as  the  "minimum  information  solution"  (Ref.  1).  Regularization 
methods  are  designed  specifically  to  solve  Eq.  (1)  in  the  form 

Ax  = y + e (7) 

where  y is  still  the  n-dimensional  vector  of  measured  quantities, 
but  e is  the  n-dimensional  vector  of  measurement  errors,  so  that 
y + e would  be  the  errorless  vector.  The  Twomey-Phillips  solution 
of  Eq.  (7)  is 

X = x°  + A*^(AA^  + yl  ) (y  ■"  Ax^)  (8) 

o 

where  x is  an  initial  approximation  to  the  solution  and  y is  the 
smoothing  (or  regularization)  parameter . 


We  must  now  establish  the  iterative  dual  to  Eq.  (8) , which 
requires  the  matrix  identity 
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This  identity  permits  us  to  write  Eq.  (8)  in  the  form 


X = 


(10) 


Proper  interpretation  of  this  equation  yields  the  required  iter- 
ation formula.  We  start  the  iterative  process  by  setting  x = 
on  the  right-hand  side  of  Eq.  (10),  that  is,  we  introduce  a one- 
step  lag  between  the  ^*s  on  the  left-  and  right-hand  sides  of  the 
equation.  Then  x°  and  x on  the  right-hand  side  are  updated  with 
the  last  approximation  of  x on  the  left.  If  a is  used  as  the  con- 
vergence factor,  the  following  iterative  equation  results: 


k k - 1 . . ,T,  ^ k - 1. 

X = X + a/Y  A (y  - Ax  ) (11) 

- - Of  Of- 

This  interpretation  of  Eq.  (10)  is  purely  heuristic,  but  it  will 
be  justified  in  the  next  section. 


Now,  the  convergent  solution  of  Eq.  (11)  is  not  the  dual  of 

the  solution  in  Eq.  (8) . To  see  this,  use  the  relationships  in 

T T 

Eq.  (4)  with  A replaced  everywhere  by  1/yA  . Then  for 

a < 2 y/||  a a II,  we  have 
lim  k 

X = A (AA  ) y, 

X 00  f\j 


(12) 


which  again  is  the  least  squares  solution  of  Eq.  (1)  and,  there- 
fore, not  the  dual  of  the  solution  in  Eq.  (8).  In  the  next  section 
the  procedxire  for  finding  the  virtual  dual  of  the  solution  in 
Eq.  (8)  is  described.  Recall  from  the  introduction  that  we  cannot 
have  exact  duality  for  regularized  solutions. 


C.  The  Virtual  Iterative  Dual 


In  Ref.  3,  it  is  shown  that  the  generalized  inverse  solution 

T 

of  Eq.  (1) , even  when  AA  is  singular,  can  be  obtained  from  the 
limiting  process 


X 


lim 
Y ^ 0 


-1 


Yi  ) 


(13) 


331 


Furthermore,  it  is  shown  that  by  aborting  the  passage  to  the  limit 
zero  at  that  value  of  y which  the  residual  ||  ^ - y j|  is 

equal  to  the  norm  of  the  random  error  vector  e of  y,  we  obtain 
the  Twomey-Phillips  solution. 

The  virtual  iterative  dual  to  the  solution,  in  Eq.  (8) , is 
found  in  an  analogous  way.  Instead  of  passing  to  the  limit  as  in 
Eq.  (12) , one  stops  the  iteration  process  at  that  step  for  which 
the  residual  i|  Ax^  - y |l  is  equal  to  the  norm  of  £ in  Eq.  (7)  . 

Just  as  the  amount  of  smoothing  of  the  solution  in  Eq.  (8)  is 
determined  by  how  soon  the  approach  of  y to  zero  in  Eq.  (13)  is 
aborted,  the  amount  of  smoothing  in  the  virtual  iterative  dual 
in  Eq.  (11)  is  determined  by  how  soon  the  iteration  process  is 
stopped. 

The  equation  that  ensues  when  the  iteration  of  Eq.  (11)  is 

T 

terminated  after  k steps  is  given  by  Eq.  ( 4)  with  A replaced 

T 

everywhere  by  1/yA  , namely, 

O; 

k TT”1  Tk  O 

X = A (AA  ) y + (I  - a/yA  A)  x - A (AA  ) y (14) 

~ ~ M^m  OrOj  L"'  ^ J 

To  prove  that  this  solution  is  the  virtual  dual  of  the  solution 

in  Eq.  (8) , we  must  show  that  for  some  finite  k , the  solution 

^o  ^o  ^ \ ^ 

X of  Eq.  (14)  is  such  that^  - x from  Eq.  (8)  , and  that 

II  Ax  ^ “ y 11  - II  £ ||.  The  symbol  is  to  be  read  as  "is 

virtually  equal  to",  whose  meaning  is  made  more  precise  in  the 

subsequent  discussion. 

III.  PROOF  OF  VIRTUAL  DUALITY 

V-fe 

A.  Eigenvalue  Decomposition 

We  begin  the  proof  of  virtual  duality  by  first  reducing » the 
dimensionality  of  matrices  from  m to  the  smaller  dimension  n 
where  necessary.  The  first  such  case  is  the  power  matrix  in 
Eq.  (14)  . Subtract  x*^  from  both  sides  of  Eq.  (14)  , apply  the 
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identity  in  Eq.  (5) , then  apply  the  identity  in  Eq.  (6) , and 
finally  use  the  identity  in  Eq.  (5)  again  to  obtain  the  forms 


T Ic 

a/YA  A) 

m T ^ 

A (AA^)  y 

o 

- X 

L o.m  O/m 

0/0/  J L 

0/  0.0/  ~ 

tT 

T ^ 

1 T , 

o 

- a/yAA  ) 

(AA  ) (y 

- Ax  ) 

0/  L 

r\/\, 

J “W 

0/^ 

(15) 


Now  to  prove  that  x - x,  we  make  the  following  substitu- 
tion in  Eq.  (15) : 


-1 

T k 

for  (I  “ a/vAA  ) and  show  that  this  does  indeed  transform  Eq.  (14) 
into  Eq.  (8) . Then  we  show  that  the  expressions  in  Eq-  (16)  are 
virtually  equal  for  the  iteration  index  k . Finally,  we  show  that 
the  index  k is  such  that  ||ax  "’yII  ~ ll^ll  Consequently,  the 
two  solutions  in  Eqs.  (14)  and  (8)  must  be  virtually  the  same  and 
the  virtual  duality  between  them  is  established. 

We  start  by  making  the  substitution,  of  Eq-  (16) , in  Eq.  (15) 
and  apply  the  identity 


I - (I  + B)"^  = (I  + B)  ^B  = B{I  + B)  ^ (17) 

to  arrive  at  the  Eq-  (8) . This  proves  that  if  the  substitution 

of  Eq.  (16)  is  made  in  Eq.  (15) , then  Eq-  (14)  is  indeed  the 

virtual  dual  of  the  solution  in  Eq.  (8)  for  k = k . 

o 

Next,  we  show  that  in  the  expression  in  Eq.  (16) , the  power 

matrix  is  virtually  equal  to  the  inverse  matrix.  To  do  this,  we 

T 

must  consider  the  diagonalization  of  the  symmetric  matrix  AA  . 

.p 

Associated  with  the  n x n matrix  AA  are  the  n x n orthogonal 
matrix  of  eigenvectors  U and  eigenvalues 
such  that 


AA 

0/0. 

T 

U U 

0/  O/ 


T 

UAU 

OAA/ 


0.0. 


I 

O/n 


(18a) 

(18b) 
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(18c) 


A = diag(A, , . . . , A ) 

^ n 

where  diag(  ) denotes  a diagonal  matrix. 

Now  apply  Eqs.  (18a)  and  (18b)  to  the  matrices  in  Eq.  (16). 
This  yields  the  decompositions 

(I  + 1/yAA^)  = U(I  + 1/yA)“^u'^  (19a) 

^n  O/b  f\j  r\j  r\j 


and 


(I„ 

^n 


k..T 


o/yAA  ) = U(I  - a/YA)"U 

<W;  'Xi  'Vin  r\,  r\j 


(19b) 


Thus,  the  whole  question  of  virtual  duality  between  Eqs.  (8)  and 
(11)  comes  down  to  comparing  a function  of  eigenvalues  of  the  form 


f^(z)  = (1  - az)^ 
with  one  of  the  form 


(20a) 


h(z)  = (1  + z) 


(20b) 


where  z = X/y  > 0. 


B.  Proof  that  the  Eigenvalues  are  Virtually  Equal 

Recall  that  there  are  several  constraints  on  the  problem . 

For  convergence  we  need  | 1 - az | < 1,  or  0 .<  az  < 2,  but  to  pre- 
vent alternating  signs  in  the  iteration  process,  we  use  the  fur- 
ther restriction  0 < az  ^ 1.  This  gives  us  the  inequalities 
0 ^ f ( z ) <1  and  0 < h ( z ) < 1 . 

To  see  that  h(z)  - f (z)  , consider  Fig.  2'  in  which  the 
curves  for  fj^(z)  with  k = 1,2,4,8,16,32,  and  128  are  plotted  for 
a = 0.01.  This  value  for  a is  typical.  The  dashed  curve  is  that 
of  h(z)  for  the  domain  0 ^ az  ^ 1,  that  is,  for  0 ^ z ^ 100.  Note 
that  the  variable  x in  Fig.  2 corresponds  to  z in  tfhe  text.. 

Figure  2 is  to  be  interpreted  in  the  following  manner.  Given 

T 

the  eigenvalues  of  the  operator  1/yAA  , which  are  located  on  the 
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Figure  2.  The  functions  of  eigenvalues  h(x)  (the  dashed 

curve)  and  f.(x)  (the  family  of  solid  curves)  show  how  close  the 

eigenvalues  are  for  the  case  a = O.Oi.  The  circles  are  the  six 

eigenvalues  of  the  inverse  matrix  corresponding  to  the  six  eigen 
T 

vectors  of  1/yAA  . (Note  that  the  variable  x in  the  figure 
corresponds  to.  the  variable  z in  the  text.)  (1  har  = 100  kPa.) 

x-axiSr  the  eigenvalues  of  the  inverse  operator,  in  Eq.  (19a), 
are  the  corresponding  points  on  the  dashed  curve.  Similarly, 
corresponding  to  these  same  eigenvalues  on  the  x-axis  are  the 
eigenvalues  of  the  approximating  operator  in  Eq.  (19b)  on  the 
family  of  solid  curves  for  the  various  iteration  indices  k*  Thui 


if  one  of  the  k-curves  in  Fig.  2 were  to  lie  everywhere  on  the 
dashed  curve,  that  would  be  the  curve  fj^^(z)  which  we  are  seeking, 
because  then  any  eigenvalue  of  the  one  operator  would  be  an  eigen- 
value of  the  other,  and  the  two  operators,  in  Eqs.  (19a)  and  (19b) 
would  be  identical — giving  us  the  duality  we  are  looking  for. 
However,  it  is  obvious  from  the  figure  that  we  cannot  have  perfect 
agreement  between  any  of  the  k-curves  and  the  dashed  curve. 

Instead,  there  is  a curve  k^  which  is  in  best  "virtual"  agreement 
with  the  dashed  curve  in  the  sense  that  the  maximum  departure 
between  the  curves  is  minimized. 

An  example  will  clarify  the  situation.  A typical  set  of 
T 

eigenvalues  for  1/yAA  based  on  the  six  CO^  weighting  functions  of 
the  VTPR  sounder  is  given  by  {0.03,  1.10,  3.84,  11.01,  39.87, 

97.88  } . The  corresponding  eigenvalues  of  the  inverse  operator, 
in  Eq.  (19a),  are  shown  in  Fig.  2 as  circles  on  the  dashed  curve. 
Notice  that  the  eigenvalues  are  distributed  over  the  entire  extent 
of  the  x-axis.  This  can  always  be  made  to  occur  because  the  con- 
vergence factor  a is  chosen  in  such  a way  that  (a/y)  ||  AA*^  ||  < 1. 
Since  AA  I is  equal  to  the  largest  eigenvalue  of  AA  , picking 

a to  be  close  to  y/||  AA  ||  guarantees  that  the  largest  eigenvalue 
T 

of  1/yAA  is  near  f^  (x)  = 0,  that  is,  near  x = 1/a  in  Fig.  2. 

On  the  other  hand,  the  smallest  eigenvalue  will  always  be 

T 

close  to  zero,  because  the  matrix  AA  is  ill-conditioned  with 

OrO/ 

respect  to  matrix  inversion,  that  is,  the  ratio" between  the  largest 
and  smallest  eigenvalue  (the  P-qondition  number)  is  in  considerable 
excess  of  the  value  100.  Of  course,  it  was  this  ill-conditioning 
that  forced  us  to  use  regularization  methods  in  the  first  place. 

T 

The  smallest  eigenvalue  of  1/yAA  is  the  important  one, 
because  it  corresponds  to  the  largest  one  in  the  inverse  operator 
in  Eq.  (19a) . In  fact,  since  the  smallest  eigenvalue  is  almost 
zero,  the  corresponding  one  in  the  inverse  matrix  will  be  almost 
one.  It  is  clear  from  Fig-  2 that  the  approximate  operator 
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Eq.  (19b)  will  also  have  its  largest  eigenvalue  near  unity,  and, 
in  fact,  the  curves  h(x)  and  fj^(x)  have  their  best  agreement  there. 
This  is  very  encouraging  because  the  norm  of  the  operators  in 
Eqs.  (19a)  and  (19b)  is  equal  to  their  largest  eigenvalues. 
Therefore,  the  two  operators  will  agree  in  the  most  important 
sense — their  norms  will  be  virtually  the  same. 

We  know  from  the  theory  of  empirical  orthogonal  functions, 
see  Ref.  4,  that  the  structure  of  any  positive  definite  matrix,  such 
as  in  Eqs-  (19a)  or  (19b) , is  determined  primarily  by  its  largest 
eigenvalue,  and  decreasingly  so  with  decreasing  eigenvalues;  the 
smallest  one  contributes  the  least.  It  is  clear  from  Fig.  2 that 
the  larger  eigenvalues  of  the  matrices,  in  Eqs.  (19a)  and  (19b), 
match  up  the  best  and  that  the  intermediate  ones  match  up  to  a 
lesser  extent.  The  smaller  ones  compare  favorably  again,  but  we 
just  noted  that  they  are  not  very  important. 

It  is  now  clear  that,  graphically  at  least,  a case  can  be 
made  for  the  existence  of  virtual  duality  between  solutions,  in 
Eqs.  (11)  and  (8)  . (A  more  precise  argiiment  is  developed  in  a sub- 
sequent section.)  In  other  words,  there  exists  a such  that  the 
curve  f]^(z)  is  very  similar  to  the  curve  h(z)  . That  says  that 
the  eigenvalues  of  the  matrices,  in  Eqs.  (19a)  and  (19b),  are  almost 
the  same;  consequently,  the  matrices  themselves  are  almost  the 
same.  This,  in  turn,  says  that  Eqs.  (15)  and  (8)  give  virtually 
the  same  solutions  which,  finally,  establishes  the  duality  between 
solutions  in  Eqs.  (11)  and  (8). 

C.  Convergence  Properties 

Figure  2 is  useful  also  in  analyzing  convergence  properties, 
but  first  we  simplify  the  interpretation  of  the  figure  by  letting 
t = az.  Then,  t,  h,  and  all  the  vary  between  0 and  1,  and 

Eqs.  (20a)  and  (20b)  become,  respectively. 
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fj^(t)  = (1  - t)^  (21a) 

and 

h(t)  = (1  + t/a)"^  (21b) 

Now  imagine  that  the  x-axis  in  Fig.  2 has  been  replaced  by  a 
t-axis  that  varies  between  0 and  1.  This  allows  us  to  answer  con- 
vergence questions  very  easily,  because  now  the  family  of  f -curves 

jc 

remains  fixed  for  variable  a and  the  single  dashed  curve  moves 
instead. 

The  first  question  is:  What  happens  when  one  chooses  a 

smaller  a?  From  Eq.  (21b) , it  is  clear  that  a smaller  a produces 
a smaller  value  of  h.  Thus,  the  elbow  of  the  dashed  curve  in 
Fig.  2 will  move  even  further  into  the  lower  left-hand  corner  of 
the  figure  and  become  more  L-shaped;  This  results  in  better  agree- 
ment with  one  of  the  f^^-curves,  but  a greater  number  of  iterations 
are  required  for  convergence.  Therefore,  the  smaller  the  choice 
of  a,  the  truer  the  duality,  but  the  larger  the  number  of  itera- 
tions that  are  required. 

The  converse  of  the  foregoing  is  that  as  a increases,  the 
number  of  iterations  decreases,  but  the  duality  between  solutions, 
in  Eqs.  (11)  and  (8) , degrades-  If  the  value  of  a is  made  too 
large,  it  may  exceed  y/||  AA*^  ||  , in  which  case  Eq.  (11)  diverges. 
What  is  the  graphical  interpretation  of  a divergent  solution  in 
Fig.  2?  In  this  case,  the  dashed  curve  will  lie  entirely  above 
the  curve  labeled  "k  = 1,"  except  at  the  point  (0,1).  Under  these 
circumstances,  no  amount  of  iteration  will  yield  a solution — not 
even  the  first  iteration.  In  fact,  the  more  the  solution  is  iter- 
ated, the  further  the  solution,  in  Eq.  (11) , will  move  from  the 
correct  solution  in  Eq.  (8) - 

If  we  have  two  initial  approximations  of  different  accuracy, 
will  the  more  accurate  approximation  require  fewer  iterations  than 
the  less  accurate  one?  A better  initial  approximation  means  that 
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Y in  Eqs.  (8)  and  (11)  must  be  increased.  This  implies  that  a 
can  be  increased,  since  a < y/||  II-  Hence,  our  question  is 
answered  in  the  affirmative.  More  is  said  about  the  initial 
approximation  in  a subsequent  section. 

Finally,  Fig.  2 clearly  illustrates  the  important  fact  that 
the  iterative  analogue  of  regularization  is  to  terminate  the 
iterations  after  a finite  number  of  steps.  The  dashed  curve  in 
the  figure  lies  completely  to  the  right  of  the  solid  curve  labeled 
"k  = 128".  Therefore,  if  one  were  to  iterate  to  convergence,  the 
agreement  between  h(z)  and  would  be  quite  poor.  Clearly, 

the  optimum  number  of  successive  approximations  is  finite — being 
fewer-  than  128  when  a = 0.01. 


D.  Proof  that  the  Error  Norm  is  Satisfied 

All  that  remains  to  be  proved  is  that  index  k is  such  that 
k k ^ 

II  ^ ^ ” y II  - II  11  / where  x ^ is  given  by  Eq.  (11)  , or  equiva- 
lently, Eq.  (14)  . To  prove  this,  multiply  Eq.  (14)  by  the  matrix 
A and  apply  the  identity,  in  Eq.  (6) , to  the  resulting  equation, 
that  is. 

Ax  ^ - y = (I  - a/yAA  ) ^ (Ax^  - y) 

-1 

- (I  + 1/yAA^)  (Ax:°  - y)  (22) 

^n 


where  the  relationship,  in  Eq.  (16) , was  used  for 


On  the  other  hand,  multiplication  of  Eq.  (8) 
and  application  of  the  identity  in  Eq.  (17)  yield 


Ax 


^ . o r 

k = Rx  + 

'v,-  L 


T -1 

I - y(aa.  + yI  ) 

^n  ^\fn 


(y  - Ax'") 


the  second  line, 
by  the  matrix  A 


which  implies  that 

-1 

Ak  - y = (I  + l/yAA*^)  (Ax^  - y)  = £ (23) 

r\j'^  - f\j^  f\/\j  f\j-^ 
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where  the  last  equality  follows  from  the  fact  that  x satisfies 

Eq.  (7) . By  comparing  Eqs.  (22)  and  (23) , we  see  that 

IIax  “y||  - Thus,  the  proof  of  virtual  duality  between 

^ ~ y ~ 

the  solutions  in  Eqs.  (8)  and  (11),  for  k = k^,  is  complete. 

IV.  CLOSENESS  OF  THE  DUAL  SOLUTIONS 


A.  Closeness  of  the  Eigenvalues 

In  the  previous  section,  we  investigated  the  closeness  of  the 

curves  h(z)  and  fv  (z)  geometrically.  Now  we  study  the  closeness 
•^o 

of  these  two  curves  analytically  to  get  a niomerical  estimate  of 

their  difference.  The  function  h(z)  is  fixed,  but  the  functions 

f^(z)  vary  with  k.  Thus,  the  first  task  is  to  find  that  k = k^ 

for  which  f,  (z)  is  closest  to  h(z).  It  is  impossible  to  know  k 
JC  o 

a priori,  but  an  acceptable  choice  for  k^  is  the  one  that  minimizes 
the  squared  differences  between  corresponding  eigenvalues  of  the 
matrices  in  Eq.  (16) . However,  the  closed  form  solution  of  this 
minimization  problem  for  arbitrary  eigenvalues  and  arbitrary  a is 
intractable.  As  an  alternative,  we  will  do  the  problem  analyti- 
cally, and  with  the  aid  of  several  approximations,  arrive  at  a 
reasonable  (but  not  optimal)  a priori  estimate  of  k^. 

Let  t = az,  then,  since  ^ t ^ 1,  an  analytic  measure  of  the 
closeness  between  h(z)  and  fj^(z)  of  Eqs.  (20a)  and  (20b)  is  given 
by  the  integral 

F(k)  = [ 1/(1  + t/a)  - (1  - j^dt  (24) 


To  find  the  k that  minimizes  this  difference,  w^^  solve  the 
equation  dF/dk  = 0,  which  implies  that 


(1  - t)"  ln(l  - t) 
1 + t/a 


dt  = - (2k  + 1) 


-2 


(25) 


If  we  could  evaluate  this  integral  in  closed  form  for  a and  k,  we 
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would  know  k as  a function  of  a.  This  cannot  be  done,  but  the 

k ”kt 

approximation  (1  - t)  ln(l  - t)  te  , valid  for  sufficiently 
large  k,  puts  the  integral  into  the  form 

te  ^^/(l  + t/a)  dt  - a/(2k)  (26) 

0 ^ 

In  evaluating  the  integral  in  Eq-  (26) , it  was  assumed  that  k was 

-k 

sufficiently  large  so  that  e - 0,  and  the  mean-value  theorem 
was  used  to  obtain  the  estimate 


■a  + 1 
a 


-kt 
e 

t 


dt 


rOt  + 1 


a + 9 


-kt 

e dt 


-ak 
e 

2ak 


for  0 < 0 < 1.  It  was  found  empirically  that  a reasonable  value 
for  0 is  a.  This  choice  is  based  on  the  observation  that  normally 
a < < 1,  and  so  in  the  interval  [a, a + 1]  the  function  1/t  con- 
tributes much  more  to  the  integrand  at  a than  at  a + 1. 

Ecfuations  (25)  and  (26)  can  now  be  combined  to  yield 

k = ^ [d  - 2a)  + (1  - 4a) 

But  since  a is  small,  the  approximation  (1  - x)^  - 1 - nx  is  valid, 
and  we  obtain  the  following  explicit  expression  for  a k that  makes 
the  difference,  in  Eq.  (24),  small,  but  does  not  necessarily 
minimize  it: 


The  formula  in  Eq.  (27)  is  an  acceptable  approximation  for  k^ 
whenever  a < 0.1. 

Recall  that  in  Fig.  2,  we  chose  a = 0.01,  in  which  case 
k^  = 49  by  the  formula  in  Eq.  (27) . Examination  of  Fig.  2.  con- 
firms that  the  curve  k = 49  is  a reasonable  estimate,  particularly 
for  matching  the  larger  eigenvalues  of  the  inverse  operators. 

We  can  now  go  back  to  Eq.  (24)  and  estimate  F(k^).  Using 
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the  approximation 


j-l  2k 

F(k  ) < + (1  - t)  ° dt, 

° -*0  (1  + t/a)^  ‘D 

we  find  by  Eq.  (27)  that 

F(k  ) 1 2a/(l  - a^)  (28) 

o 

The  importance  of  the  relationship,  in  Eq-  (28) , is  that  it  says 
that  the  integrated  difference  of  Eq.  (24)  approaches  0 as  a ->■  0. 

In  other  words,  h(z)  and  fj^(z)  can  be  made  arbitrarily  close  by 
making  a sufficiencly  small.  Thus,  the  smaller  a is  chosen,  the 
closer  the  virtual  duality  is  to  true  duality. 

B.  Closeness  of  the  Twomey-Phillips  Solution  to  Its  Dual 

In  the  previous  section,  we  studied  the  closeness  of  the 
eigenvalues  of  the  dual  matrices;  now,  we  examine  the  closeness 
of  the  solutions  themselves.  Shown  in  Fig.  3 are  the  rms  dif- 
ferences between  the  Twomey-Phillips  inverse  matrix  solution  and 
its  iterative  dual  for  two  different  initial  approximations-  The 
dashed  curve  represents  the  rms  differences  when  forecasts  are  used 
as  the  initial  profile,  and  the  solid  curve  represents  the  rms 
differences  when  average  climatological  initial  profiles  are  used. 

The  structure  of  Fig.  3 is  the  same  as  that  of  Fig.  1,  except 
that  the  abscissa  is  in  tenths  of  degrees  instead  of  degrees.  The 
data  used  is  also  the  same  as  that  described  for  Fig.  1,  that  is, 
the  same  139  profiles  and  corresponding  simulated  radiances  are 
used. 

The  results  in  Fig.  3 are  as  expected;  the  difference  between 
the  two  curves  is  negligible.  This  indicates  that  the  dual  solu- 
tions agree  with  one  another  irrespective  of  the  initial  profile 
used,  as' long  as  the  same  one  is  used  by  both  inversion  methods. 
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On  the  other  hand,  the  accuracy  of  each  of  the  two  curves,  although 
reasonable,  is  not  quite  as  good  as  expected.  There  are  three 
reasons  for  this. 

First,  the  y used  in  the  inverse  matrix  Twomey-Phillips 
solution  was  the  same  value  for  all  139  cases.  This  prevented 
some  of  the  solutions  from  satisfying  the  requirement  that 
il  M - y II  - ||e||.  The  scalor  y should  have  been  adjusted  in  each 
solution  relative  to  the  magnitude  of  the  difference  ||  ” Y ||  • 

However,  we  do  not  have  a reasonable  a priori  method  for  making 
this  adjustment,  and  so  a constant  average  value  for  y was  used. 

Second,  the  number  of  iterations  for  the  iterative  dual  was 
limited  to  100  which,  according  to  Eq.  (27),  is  twice  the  number 
needed  for  our  choice  of  a = .01.  Nevertheless,  a few  of  the 
cases  did  not  converge  in  the  sense  that  ||  ” V II  " II  ^ II  • 

Again,  the  problem  stems  directly  from  the  choice  of  y,  which, 
in  turn,  affects  the  convergence  factor  a,  since  a < y/||  AA^  ||  . 

Third,  the  a used  was  always  about  the  largest  value  per- 
missible in  order  to  keep  the  number  of  iterations  low.  However, 
we  saw  in  the  previous  section  that  by  decreasing  the  value  of  a 
and  increasing  the  number  of  iterations,  we  could  have  improved 
the  accuracy. 

The  overall  accuracy  in  Fig.  3 is  roughly  between  0.3  and 
0.4^C,  except  at  the  tropopause  and  at  the  surface.  At  these  two 
levels,  there  is  less  agreement  between  the  dual  solutions,  appar- 
ently due  to  the  usual  problem  of  obtaining  accurate  solutions 
there  by  any  method. 

V.  EQUATIONS  OF  A MORE  GENERAL  FORM 
A.  The  Dual  Equations 

Details  of  the  principle  of  virtual  duality  were  worked  out 
in  the  previous  sections  for  the  Phillips-Twomey  inversion  method 
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as  an  outgrowth  of  the  duality  principle  for  the  least  squares 
solutions.  However,  we  need  not  stop  there.  The  principle  of 
virtual  duality  can  be  generalized  to  include  most  of  the  better 
known  regularized  linear  inversion  methods. 

The  most  general  of  the  regularized  inverse  matrix  solutions 
of  Eq.  (7)  that  we  will  consider  has  the  form 


X = x^  + Va"^(AVa"^  + W)  ^ (y  “ Ax^)  (29) 

where  V and  W are  arbitrary  symmetric  positive  -definite  matrices 
of  dimensions  m x m and  n x n,  respectively,  subject  to  the  con- 
straint II  ^ " Y II  ” 11^  II  • Hence,  V and  W have  inverses. 

The  iterative  dual  of  Eq.  (29)  is  of  the  form 

^ + aVA\“^  (y  - Ax^  " (30) 

which  is  obtained  in  the  same  way  that  Eq.  (11)  was  derived  from 
Eq.  (8) , except  that  the  identity,  in  Eq.  (9) , must  be  replaced 
by  the  more  general  identity 


T T -1  T -1  -1  ^ T -1 

^ W A + y ) AW 


Or  ^ 


(31) 


The  dual  Eq.  (30)  converges  provided  a < 2/||  W A ||  . As  before, 
we  limit  the  number  of  iterations  to  k = k , where  k is  such  that 


II  - y 

would  have 


If  we  were  to  iterate  to  convergence,  we 


lim 

k oo 


k 

X 


T -1  T -1  T'  T ““ 

VA  W (AVA  W )y  = VA  (AVA  )y 

f\j  o/vo;  Of  ~ 'Xj'Xj  aA/0»  ~ 


(32) 


which  is  the  weighted  least  squares  solution  with  weighting  matrix 

V. 

Note  that  Eqs.  (29)  and  (30)  are  simply  generalizations  of 

T 

Eqs.  (8)  and  (11) , respectively,  in  which  1/yA  has  been  replaced 
T -1 

by  VA  W . Consequently,  all  the  previous  discussion  relating 

to  Eqs.  (12)  through  (28)  holds  for  the  more  general  situation 

T T —1 

in  which  1/yA  is  replaced  everywhere  by  VA  W • In  particular. 
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the  key  substitution,  in  Eq.  (16)  , is  replaced  by  the  substitution 

(33) 


T -1 

(I  + AVA  W ) 
^n  O; 


T — 1 k 

for  (I  - aAVA  W ) , and  the  eigenvalue  analysis  is  applied  to 

•p  'Xj  ^ 

instead  of  ^ . Therefore,  the  eigenvalues  are  different, 
as  is  the  definition  of  z in  Eqs,  (20a)  and  (20b)  . However,  none 
of  the  analyses  associated  with  Fig.  2 change.  Only  the  locations 
of  the  circles  on  the  dashed  curve  in  Fig.  2 and  the  scale  of  the 
abscissa  must  be  altered.  Thus,  we  see  that  virtual  duality  holds 
between  Eqs.  (29)  and  (30)  as  well. 


We  are  now  ready  to  reap  the  benefits  of  all  our  work  to  this 
point.  Establishment  of  the  principle  of  virtual  duality  for  the 
general  Eqs.  (29)  and  (30)  permits  us  to  examine  the  various  linear 
inverse  matrix  and  iterative  inversion  methods  and  immediately 
write  down  their  respective  duals.  Furthermore,  knowledge  of  one 
of  the  duals  implies  immediate  knowledge  of  the  other  without  fur- 
ther investigation. 


B-  Special  Cases 

In  this  section,  we  look  at  the  more  familiar  linear  inver- 
sion methods  in  the  context  of  the  duality  principle.  Included  in 
the  matrix  inverse  methods  are  those  by  Twomey  and  Phillips  (Refs. 
1 and  2),  Twomey  H-matrix  types  (Ref's.  2 and  5),  Crone  (Ref.  6), 
and  Rodgers,  Strand  and  Westwater  (Refs.  1,  7,  8 and  9).  Table  1 
indicates  the  particular  forms  that  V and  W have  for  the  special 
cases  just  cited.  (Note  that  the  identity,  in  .Eq.  (31),  is 
required  to  reconcile  Table  1 with  the  expressions  in  the  original 
publications. ) 

Since  the  methods  listed  in  Table  1 are  all  inverse  matrix 
methods,  their  iterative  duals  are  of  particular  interest.-  For 
example,  the  Rodgers-Strand-Westwater  case  by  Eq.  (30)  becomes 
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TABLE  1 


Identification  of  Inverse  Matrix  Methods 
in  Terms  of  the  V and  W Matrices 


Method 

V 

w 

Twomey-Phil 1 ips 

^m 

Twomey  H-matrix 

1/yh-i 

Crone 

1/ylm 

(AAT)-I 

Rodgers-Strand-Westwater^ 

[« 

^ 1 

i 

^ is  the  profile  covariance  matrix,  eind 
^ is  the  noise  covariance  matrix. 


T -1 
aRA  E 


(y  - Ax 
- 


k - 1, 


(34) 


which  is  a form  that  apparently  has  never  appeared  in  the  literature. 
This  is  true  as  well  of  the  iterative  Twomey  H-matrix  and  itera- 
tive Crone  methods. 


Among  the  iterative  methods,  we  have  those  by  Landweber  (Ref. 
10),  W.  L.  Smith  (Refs.  1 and  11),  Conrath-Revah  (Ref.  12),  and 
Strand  (Ref.  10).  These  are  summarized  in  Table  2 in  the  same 
format  as  Table  1. 


Of  particular  interest  are  the  inverse  matrix  duals  of  the 
iterative  methods  of  Table  2.  For  example,  by  Eq.  (29)  and  the 
identity,  in  Eq.  (31) , the  Smith  and  Conrath-Revah  cases  can  be 
written  in  the  apparently  new  form 

which  is  a Twomey  H-matrix  type  solution  with  H equal  to  a diag- 
onal  matrix. 


Listings  and  categorizations  similar  to  those  of  Tables  1 and 
2 could  be  made  for  squaire  inversion  systems,  such  as  the  linear- 
ized .Chahine  (Ref.  13),  Chiu  (Ref.  14),  expansion  in  bases  func- 
tions (Refs.  8,  9,  and  15),  and  empirical  orthogonal  functions 
(Ref.  4).  However,  the  theory  and  duality  procedures  for  square 
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TABLE  2 


Identification  of  Iterative  Methods  in 
Terms  of  the  V and  W Matrices 


Method 

V 

w 

Landweber 

Im 

In 

W.  L.  Smith  (linearized) ^ 

D“1 

In 

Conrath-Revah^ 

G-1 

In 

Strand 

(aTa  + ylm)"^ 

In 

‘d  ■ ■’^=5  (A^n)  I I = (1,1 1,’' 

= diag  (A^Al^,^)  j ^ “ 


systems  are  essentially  special  cases  of  the  rectangular  systems 
just  described;  therefore,  there  is  no  need  for  further  illus- 
tration. 


VI.  THE  INITIAL  APPROXIMATION 


A.  Theory 


At  the  beginning  of  this  paper,  where  the  iterative  dual  of 
the  least  squares  solution  was  discussed  (Eq.  (4)),  we  saw  that  in 

V 

the  limit,  as  k ^ the  temff  containing  the  initial  approximation 
vanished.  This  is  a case  in  which  the  solution  is  independent 
of  the  initial  approximation.  Unfortunately this  is  not  true  for 
any  of  the  regularized  solutions  discussed  in  this  paper. 


To  see  just  how  the  solutions  depend  upon  the  initial  approxi- 
mation, let  X represent  the  solution  of  any  of.  the  inversion 
methods  of  this  paper  when  no  initial  approximation  is  used,  that 
is,  when  x^  = 0.  Then,  Eq.  (29)  can  be  written 


X = X + 


T T 

I - VA  (AVA  + W)  A 

Opm  Or  Of 


(36) 


T — 1 T 

and  Eq.  (14) , with  the  substitution  VA  W for  1/yA  , can  be 

^\/\j  f\j  ^ 
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I 


written 


k 

X 


o 


= X + 


(I  - aVA  W A) 

Ofin  OA/  ^ 


(37) 


Application  of  the  identity 


m m — 1 T — 1 

I - VA  (AVA  + W)  A = (I  + VA  W A) 

A(A»A^  A.  Oj  A>IH  ApA;  A A 

to  Eq.  (36)  permits  us  to  write  it  in  a form  similar  to  Eq. 
namely 


(38) 
(37)  , 


X = 3C  + 


(I 

aJU 


T “1 

+ VA  W A) 

AA  A A 


-1 


X 


(39) 


Since  ^he  n x m (n  < m)  rectangular  matrix  ^ has  rank  n,  the  rank 
of  Va\”^^  is  also  n.  Consequently,  ^ ~ terms 

in  Eqs.  (37)  and  (39)  containing  the  factor  x^  cannot  vanish. 
Hence,  the  dependence  of  the  solutions,  in  Eqs.  (37)  and  (39)  on 
is  a permanent  and  significant  feature  of  regularized  solutions 
for  rectangular  systems.  This  is  the  price  one  must  pay  for 
stability. 

If  the  matrix  ^ ^ ^ is  diagonalized  by  the  m x m orthogonal 

matrix  P,  then  the  matrices  in  Eqs.  (37)  and  (39)  also  can  be 

A 

diagonalized  by  P,  that  is 

(I  - aVA\~^A)^°  = P(I  - aA)^°  p"^  (40a) 

A^ni  AA  A A A APOCI  A A 


and 


(I 

Am 


T 

VA  W A) 

AA  A A 


= P(I 

A Am 


-1  T 
A)  ^P 

A A 


(40b) 


The  functions  of  eigenvalues  in  Eqs.  (40a)  and  (40b)  are  again  of 
the  form  in  Eqs.  (20a)  and  (20b) , respectively.  Thus,  not  only  do 
x^^  and  k of  Eqs.  (37)  and  (39)  depend  on  x^,  they  depend  on  x^  in 
virtually  the  same  way.  This  result  is  certainly  not  unexpected, 
considering  the  origins  of  Eqs.  (37)  and  (39) . 
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Fig.  4.  Profiles  of  the  rms  errors  between  the  139  radio- 
sondes and  the  Twomey-Phillips  inverse  matrix  solutions  when  fore- 
casts (dashed  curve)  and  climatology  (solid  curve)  are  used  as  the 
initial  profiles . (1  bar  = 100  kPa.) 
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B.  Results 


Examples  of  the  dependence  of  the  solution  on  the  initial 
approximation  are  afforded  by  the  Twomey-Phillips  inverse  matrix 
solutions  as  shown  in  Fig.  4.  As  in  Fig.  1/  the  curves  are  rms 
differences  between  139  retrieved  temperaiiire  profiles  using  simu- 
lated radiances,  sind  the  corresponding  radiosonde  profiles  (used 
as  truth).  The  dashed  curve  in  Fig.  4 represents  the  rms  errors, 
at  the  15  mandatory  pressure  levels,  when  forecasts  are  used  for 
the  initial  profile,  while  the  solid  curve  represents  the  rms 
errors  when  climatology  is  used  for  the  initial  profile. 

As  expected,  the  retrievals  based  on  the  forecasts  are  more 
accurate  than  those  based  on  climatology,  except  at  70,  100,  and 
150  mb.  The  pattern  reversal  at  these  three  levels  is  due  to  23 
profiles  between  0^  and  18^N  that  have  much  better  climatological 
initial  profiles  than  forecast  initial  profiles  in  the  vicinity  of 
the  tropopause.  This  anomaly  occurred  because  the  numerical  fore- 
cast model  applies  only  north  of  18^  latitude.  Below  that  latitude 
a single  average  initial  profile  is  used  in  lieu  of  a forecast, 
and  this  profile  just  happens  to  be  worse  than  the  climatological 
one  in  the  tropical  tropopause  region. 

VII.  CONCLUDING  REMARKS 

True  duality  exists  for  the  least  squares  solution,  but  the 
solution  is  unstable.  Therefore,  regularization  methods  must  be 
used  to  achieve  stability.  The  price  one  pays  for  this  stability 
is  the  dependence  of  the  solution  on  the  initial  approximation. 

Virtual  duality  has  been  established  for  a very  general  class 
of  linear  regularized  solutions.  The  regularization  process  for 
the  iterative  solutions  is  accomplished  by  terminating  the  iter- 
ation process  when  the  norm  of  the  residual  errors  is  equal  to  the 
norm  of  the  errors  in  the  radiances. 
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The  convergence  process  for  the  iterative  duals  can  be  demon- 

strated  graphically.  On  the  other  hand,  an  estimate  was  derived 

for  the  terminal  iteration  index  k associated  with  any  given  con- 

o 

vergence  factor  a.  For  a given  the  closeness  of  the  eigenvalues 
of  the  dual  solutions  can  be  determined  from  the  pertinent  equa- 
tions. The  concept  of  "virtual  duality"  is  meaningful  because  by 
reducing  the  size  of  a,  the  eigenvalues  of  the  dual*  operators  can 
be  made  arbitrarily  close. 

Explicit  formulas  for  the  inverse  matrix  solutions  and  their 
iterative  duals  are  given  for  the  more  common  of  the  regularized 
linear  inversion  methods.  These  include  the  following  methods: 
Twomey- Phillips,  Twomey  H-matrix,  Crone,  Rodger s-Strand- 
Westwater,  Landweber,  W.  L.  Smith,  Conrath-Revah,  and  Strand. 

This  list  is  by  no  means  complete.  In  addition,  apparently  new, 
inversion  methods  have  arisen  as  duals  of  some  of  the  known 
methods . 

Finally,  the  whole  point  in  developing . the  principle  of  vi- 
tual  duality  is  that  knowledge  of  one  of  the  duals  immediately 
implies  knowledge  of  the  other.  Thus,  the  study  of  inversion 
methods  is  more  unified  through  the  link  of  duality. 


B 

a. 


D 


E 


G 


F(k) 


f,^(z) 

h(z) 


S^TMBOLS 

n X m matrix  of  weighting  functions  (Eqs.  (1)  and  (7)) 
arbitrary  square  matrix 
Euclidean  norm  of  square  matrix  B 
diag(A  1 ) 

n . 

= S 

T -> 

diag(A  Al  ) 
n 

Twomey  H-matrix 

measure  of  closeness  of  eigenvalue  functions  f^  and  h 'for 
arbitrary  k (Eq.  (24)) 

(1  - az)^  (Eq. (20a)) 

(1  + z)"^(Eq.  (20b)) 
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CD  ?co  p N e^<;  iXi  ^s:  c?<  ?a  ft  1-3  <?cn  c?h3 


m and  n denote  the  dimension 


eigenvectors 


eigenvectors 

arbitrary  m x m symmetrix  positive  definite  matrix 
(Eqs.  (20)  and  (30) ) 

arbitrary  n x n symmetric  positive  definite  matrix 
(Eqs-  (29)  and  (30)) 

m-dimensional  source  function  vector  (Eqs.  (1)  and  (7)) 
m-dimensional  initial  approximation  vector 
kth  successive  approximation  of  the  m-dimensional 
iterative  solution  vector  (Eqs.  (3),  (11)/  and  (30)) 

m-dimensional  inverse  matrix  solution  vector  (Eqs.  (2) ^ 
(8)/  and  (29)) 

m-dimensional  solution  vector  when  no  initial  approxi- 
mation is  used  (i.e./  x^  = 0)  (Eqs,  (36),  (37),  and  (39)) 

n-dimensional  vector  of  measurements  (Eqs.  (1)  and  (7)) 
eigenvalue 

scalar  convergence  (relaxation)  factor  (Eqs.  (3),  (11), 
and  (30) ) 

smoothing  (regularization)  parameter  (Eqs.  (8)  and  (11)) 

n-dimensional  error  vector  (Eq.  (7)) 

parameter  (0  < 9 < 1) 

diagonal  matrix  of  eigenvalues 

ith  largest  eigenvalue 

"virtually  equal  to" 


I identity  matrix;  subscript 

of  the  square  matrix 

k iteration  index 

k terminal  iteration  index 

o 

m X m orthogonal  matrix  of 
~ 

profile  covariance  matrix 

noise  covariance  matrix 
e 

transpose  of  a matrix 
aZ 

n X n orthogonal  matrix  of 
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DISCUSSIONS 


Staelin:  Have  you  considered  whether  you  can  convert  from  the 

virtual  duality  principle  to  a true  duality  principle  by  altering 
the  way  the  matrix  is  regularized?  You  employed  a constant  times 
the  identity  matrix. 

Fleming:  Yes. 

Staelin:  Suppose  one  regularized,  for  example,  by  using  different 

elements  on  the  diagonal.  In  other  words,  is  there  some  way  to 
make  it  a true  duality  rather  than  only  a virtual  duality? 

Fleming:  I couldn't  find  any.  The  difficulty  is  this:  in  any 

iterative  scheme  when  you  telescope  together  the  various  iterates, 
you  always  end  up  with  a matrix  that  is  raised  to  a power.  Of 
course,  its  dual  always  involves  an  inverse  matrix.  I just  don't 
see  any  way  in  which  a power  matrix  can  ever  be  made  exactly  equal 
to  an  inverse  for  a finite  number  of  iterations.  That  is  the  real 
heart  of  the  difficulty. 

Strand:  I wanted  to  say  that  I notice  the  version  that  you  con- 

sidered is  a scalar  a,  at  least  at  first. 

Fleming:  Yes. 

Strand:  And  this,  I believe,  you  can  obtain  without  using  different 

alphas  by  dividing  both  sides  of  the  equation  by  suitable  factors 
in  the  first  place.  I was  in  contact  with  Landweber  one  time  and 
he  told  me  that  I could  get  the  effect  of  my  D matrix  in  this  way. 
Well,  the  D that  I had  was  something  slightly  different  in  that 
my  a was  a matrix.  So,  instead  of  using  your  scalar  matrix,  one 
fudges  the  thing  part  way  by  getting  a matrix  that  does  part  of  the 
job  and  then  iterates  with  that  for  the  rest  of  it.  I would  be 
surprised  if  there  wasn't  the  same  kind  of  duality  for  that  situ- 
ation. In  fact,  in  the  case  when  D equals  Twomey's  matrix,  we  are 
able  to  find  a very  close  known  relationship  between  the  y that  you 
use,  say,  for  ten  iterations  and  the  y used  for  one  iteration, 
which  is  essentially  Twomey's  process  to  start  with.  So,  although 
you  didn't  demonstrate  this,  I would  certainly  think  that  the  same 
kind  of  duality  would  hold  for  a more  general  assignment  of  what 
I call  the  D matrix. 

Fleming:  Yes,  intuitively  I feel  it  is  true  for  the  linear  problem. 

That  might  be  another  way  to  approach  the  problem. 

Cerni : You  said  that  your  solution  must  always  depend  on  the 

initial  guess,  and  there  exists  no  alternative.  Certainly  you  can 
entirely  replace;  if  you  choose,  the  constraint  to  an  initial 
guess  with  just  a smoothing  constraint.  Then  there  will  be  no 
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initial  guess  dependence.  What  sort  of  results  do  you  get  with 
purely  a smooth  constraint  in  the  inversion? 

Fleming:  Getting  rid  of  the  initial  guess  is  equivalent  to  using 

a zero  vector  in  the  vector.  You  can  smooth  all  you  like,  but 
the  larger  you  make  your  smoothing  parameter  in  any  of  these 
formulations,  the  closer  you  drive  the  solution  toward  the  initial 
approximation.  So,  if  you  started  with  a zero  vector,  you  are  just 
going  to  force  the  answer  to  look  more  and  more  like  that  constant 
value,  which  is  certainly  smooth.  But  it  is  still  dependent  on 
the  initial  guess.  Do  you  disagree?  Did  I answer  your  question? 

Cerni:  No,  I mean  in  certain  instances  we  get  fine  results  with 

a smoothing  constraint  alone  by  reducing  the  high  frequency  com- 
ponents  of  the  solution-  With  no  initial  guess,  we  get  a very 
representative  solution . 

Fleming:  I don't  know  in  what  context  you  are  talking  about  this. 

First  of  all,  do  you  have  a linear  problem? 

Cerni:  Yes,  it  has  been  linearized. 

Fleming:  Normally,  what  kind  of  vectors  are  you  looking  for?  Are 

you  looking  for  something  that  is  close  to  zero? 

Cerni : No . 

Fleming:  The  chances  are  that  you  have  a stable  system  and  then 

the  regularized  solutions  are  not  necessary.  Did  you  look  at  the 
conditioning  of  the  matrix  you  are  inverting?  If  it  is  well- 
conditioned  you  don't  need  a regularization  method.  In  that  case, 
you  should  use  the  least  squares  solution.  We  saw  that  the  least 
squares  solution  is  independent  of  initial  approximation-  So  I 
think  what  we  are  saying  is  that  your  problem  is  in  a different 
context  than  mine  is.  [Added  in  proof:  In  retrospect,  it  appears 
that  Dr.  Cerni  always  uses  the  same  first  guess  (the  implied  zero 
vector);  therefore,  the  question  of  the  dependence  of  the  solution 
on  the  initial  guess  is  not  germane.] 

Herman:  When  you  say  that  the  final  solution  is  dependent  on  the 

initial  guess,  what  degree  of  dependence  do  you  mean?  I will  be 
showing  this  afternoon,  and  earlier  in  the  week,  some  of  the 
Arizona  group  have  also  shown  that  inversions  of  real  data  starting 
with  widely  varying  first  guesses  almost  overlap  each  other.  Now 
this  was  done  using  an  H matrix  and  not  the  other  technique  you 
were  talking  about,  which  you  said  does  not  fit  in  this  category, 
but,  nevertheless,  it  is  relatively  insensitive  to  the  initial 
guess  and  is  an  ill-conditioned  matrix. 
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Fleming:  I am  simply  saying  that  the  better  the  initial  approxi- 

mation, the  better  reliance  you  can  have  on  the  solution,  resulting 
in  nairrower  error  bars  around  your  solution.  I am  not  saying  it 
is  dependent  in  a particular  way;  I am  saying  there  is  a strong 
dependence  which  differs. with  each  different  kind  of  initial 
approximation  and  with  each  different  method  of  solution. 

Rodgers:  One  would  hope  that  the  solution  would  depend  on  the 

initial  guess  because  the  initial  guess  is  a virtual  measurement 
and  the  solution  also  depends  on  the  measurements.  That  wasn't 
what  I really  wanted  to  ask.  I feel  sure  that  some  kind  of  linear 
transformation  can  turn  the  more  general  linear  method  (where  you 
have  two  covariance  matrices)  into  a form  with  a dual  by  suitable 
eigenvector  transformation. 

Fleming:  I feel  it  can,  but  I just  don't  know  how. 

Rodgers:  Well,  you  can  transform  the  covariance  matrix  to  a 

diagonal  form  or  a unit  form. 

Fleming:  Yes,  but  even  if  they  are  diagonal,  they  don't  commute. 

You  can't  move  the  diagonal  from  one  side  of  the  matrix  to  the 
other  side.  This  is  where  the  whole  thing  broke  down  on  me. 

Rodgers:  You  can  turn  a covariance  matrix  into  a unit  matrix  by 

a suitable  scale  transformation  and  rotation.  That's  what  you 
need,  isn't  it? 

Fleming:  But  then  will  it  be  the  same  matrix?  Will  it  hold  for 

A^A  as  well  as  the  transformed  matrix? 

Rodgers:  Well,  it  will  give  you  a different  A.  If  you  apply  a 
suitable  transformation  to  X,  such  that  what  is  equivalent  to  the 
covariance  of  X turns  into  unit  matrix,  then  A is  going  to  change 
to  something  else.  And  then  I think  you  end  up  with  a form  that 
has  a dual. 

Fleming:  If  you  can  do  that,  my  theory  applies  and  one  has 

duality.  [Added  in  proof:  In  the  final  text  of  the  paper,  this 
difficulty  has  been  overcome,  and  the  virtual  duality  principle 
is  shown  to  hold  for  all  of  the  better  known  linear  inversion 
methods . ] 

Twitty:  This  is  just  a comment.  You  told  Dr.  Cerni  that  it 

sounded  like  his  solution  might  fall  in  the  category  of  having  a 
stable  solution,  because  the  inverse  of  the  matrix  really  exists. 

In  most  of  the  cases  we  have,  that  is  not  true.  We  have  an 
unstable  matrix.  There  must  be  something  that  goes  continuously 
from  the  case  where  you  have  to  regularize  it  and  where  you  don't. 
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There  must  be  some  measure,  or  scale,  in  this  analysis  that  could 
measure  how  stable  a situation  you  have. 

Fleming:  Yes,  and  that  is  usually  the  so-called  P-condition  number 

of  A,  which  is  the  ratio  of  the  largest  to  the  smallest  eigenvalue 
of  A^A.  That  is  one  way  of  looking  at  it,  if  you  want  to  do  an 
eigenvalue  analysis  on  the  matrix. 

Twitty:  Yes,  but  perhaps  a measure  that  would  relate  to  something 

more  physical,  because  we  are  talking  about  whether  we  have  virtual 
data  or  not  virtual  data  in  the  solution. 

Fleming:  One  quick  thing  that  can  be  done  is  to  solve  the  problem 

in  the  best  way  you  know-  Then  perturb  the  input  vector  of 
measured  quantities  in  a systematic  and  realistic  way  and  solve  the 
problem  again.  If  the  new  solution  differs  from  the  original 
solution  in  a proportionate  and  physically  acceptable  way,  the 
problem  is  well-conditioned.  Otherwise,  some  kind  of  regialarization 
and/or  additional  virtual  data  are  needed. 

Twitty:  Yes,  I understand  you  can  take  a specific  problem  like 

that  and  determine  the  regime  you  are  in.  You  seem  to  have  been 
attacking  the  problem  for  getting  more  physical  insight  into  things. 

Westwater:  You  mentioned  that  the  duality  did  not  exist  when  you 

considered  the  second  or  first  derivative  constraint.  Have  you 
used  this  type  of  constraint  in  temperature  retrievals?  And  if 
you  have,  how  does  that  compare  with  the  standard  Twomey  minimi- 
zation around  an  initial  guess? 

Fleming:  They  are  generally  very  close.  The  nice  thing  about  the 

smoothing  techniques  is  that  the  answer  is  relatively  insensitive 
to  the  kind  of  smoother  you  use.  The  type  of  constraint  used  is 
critical  only  in  those  regions  where  the  weighting  functions  are 
close  to  zero.  Let  me  give  you  an  example.  We  were  looking  at 
some  stratospheric  sounding  problems  at  one  time  which  involved 
the  so-called  "sudden  warming."  If  you  use  a sounder  such  as  the 
VTPR  for  which  most  of  the  weighting  functions  peak  the  tropo- 
sphere, there  is  little  information  about  the  stratosphere.  So  if 
you  have  a sudden  warming , situation,  the  radiances  corresponding 
to  the  stratosphere  increase  drastically.  Consequently,  when  you 
retrieve  a temperature  profile,  all  that  excess  energy  is  dumped 
into  the  lower  part  of  the  atmosphere  where  the  weighting  functions 
peak,  and  you  lose  all  the  information  on  the  stratospheric  warming. 
After  experimenting  with  the  various  kinds  of  smoothers,  we  found 
that  we  could  restore  the  sudden  warming  portion  of  the  temperature 
profile,  which  is  above  the  weighting  function  peaks  by  using  the 
right  kind  of  smoother.  In  other  words,  the  smoothing  constraints 
provide  the  information  where  the  weighting  fiinctions  trail  off  to 
zero, but  have  little  effect  on  the  rest  of  the  solution.  This  is 
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completely  in  the  vein  of  Clive  Rodgers*  talk.  You  have  to  know 
a priori  the  kind  of  information  you  are  working  with  and  use  it 
in  the  correct  way..  All  this  is  as  much  an  art  as  it  is  a 
science. 

Twomey:  Henry,  I never  thought  I *d  hear  you  say  that! 
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INVERSION  OF  PASSIVE  MICROWAVE  REMOTE 


SENSING  DATA  FROM  SATELLITES 


David  H.  Staelin 

Massachusetts  Institute  of  Technology 


Global  passive  microwave  observations  from  Earth- 
orbiting satellites  have  mapped  humidity  and  liquid  water 
over  ocean f temperature  profiles ^ ice  and  snow,  and  other 
geophysical  parameters • Future  satellites  will  extend  the 
altitude  range  above  100  km  and  will  expand  the  list  of 
trace  constituents  that  can  be  monitored. 

In  most  applications , the  Inversion  problem  Is  ade- 
quately approximated  as  linear  with  jointly  Gaussian  sta- 
tistics, and,  thus,  a linear  retrieval  performs  well.  In 
some  cases,  the  problem  Is  typically  factored  Into  a 
decision  process  followed  by  appropriate  linear  or  quasl- 
llnear  processes . Certain  problems,  however,  require  more 
powerful  nonlinear  or  nonstationary  procedures,  such  as 
Kalman  filtering. 


I .  INTRODUCTION 

Passive  microwave  sensing  techniques  are  employed  in  geo- 
physics, radioastronomy,  biomedical  and  other  areas.  The  special 
value  of  these  techniques  results  primarily  from  the  facts  that 

1.  microwave  receivers  can  sense  thermal  radiation  with  one- 
second  sensitivities  as  high  as  10”^K. 

2 . microwave  radiation  penetrates  most  matter  much  more 
readily  than  does  infrared  radiation,  which  is  the  usual  alterna- 
tive technology;  and 

3.  many  gases  have  microwave  resonances  which  can  readily  be 
resolved,  and  which  permit  observation  even  in  some  cases  where 
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other  spectral  regions  are  inadequate.  Geophysical  applications 
have  been  reviewed  by  Staelin  (Ref.  1) , Tomiyasu  (Ref-  2) , Waters 
(Ref.  3),  and  others. 

One  important  property  of  microwave  thermal  radiation  results 

because  hv  <<  kT  for  most  cases  of  interest  (where  h is  Planck's 

constant,  v is  frequency,  k is  Boltzmann's  constant,  and  T is 

temperature) . For  example,  if  T - 100  K,  then  v must  be  much  less 

1 2 

than  2000  GHz  (2  x 10  Hz) . This  constraint  is  usually  met,  and 

then  Planck's  radiation  law  passes  to  the  Rayleigh- Jeans  limit, 

where  thermal  radiation  intensity  I is 

2kT  f —2  “1  “1^  y 

I ^ (W  m Hz  ster  ) (1) 

X 

(X  is  wavelength)  and  the  power  in  a single-mode  transmission  line 
is 

P = kT  (W  Hz~^)  (2) 

The  linear  relation  between  temperature  and  power  generally 
results  in  more  nearly  linear  retrieval  problems  when  data  are  to 
be  interpreted. 

It  also  follows  that  the  responsiveness  (AP/AT)  of  receivers 

is  not  degraded  for  low-temperature  sources.  Receiver  sensitivities 

At  (K)  typically  depend  on  radio-frequency  bandwidth  B (Hz) , 
rms 

integration  time  x (sec) , receiver  noise  temperature  T (K) , and 

R 

the  antenna  or  source  temperature  T^  (K) , as  follows : 


T + T 

at  (K)  - 2 - 

rms 


(3) 


The  source  temperature,  viewed  from  a terrestrial  satellite,  might 

be  '^^250  K,  and  T (below  200  GHz)  could  range  from  10  to  5,000  K, 

R 

with  1000  K being  typical.  B might  range  from  10^  to  10^  Hz,  with 

10®  being  typical;  therefore,  AT  for  1-sec  averaging  might  be 

rms 

0.01-10  K,  with  0.24  being  typical.  Note  that  the  receiver  sensi- 
tivity actually  improves  for  colder  targets.  Receiver  and  antenna 
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technology  for  remote  sensing  is  similar  to  that  for  radioastron- 
omy, which  has  been  reviewed  by  Kraus  (Ref-  4) , and  others. 

The  equation  of  radiative  transfer  appropriate  for  microwaves 

Q n Q 

(10  - 10  Hz)  in  the  terrestrial  atmosphere  is  usually  approxi- 

mated as  that  for  a slightly  lossy  medium  in  local  theimio- 
dynamic  equilibrium.  Scattering  from  tropospheric  inhomogeneities 
is  very  weak  and,  like  refractive  and  ionospheric  effects,  is 
usually  neglected  for  passive  microwave  remote  sensing  purposes. 
Exceptions  occur  primarily  for 

1.  tropospheric  rays  parallel  (±  ~ 1^)  to  the  geoid,  when 
refractive  effects  become  important;  and 

2-  scattering  from  cloud  droplets,  which  becomes  important 

only  when  A , where  n and  d are  the  refractive  index  and 

TTn 

diameter  of  the  drops,  respectively. 

Below  60  GHz,  this  situation  is  important  primarily  for  heavy  pre- 
cipitation. Even  then  the  absorption  of  those  same  clouds  can 
overwhelm  the  scattering  effects,  which  generally  remain  a modest 
perturbation  in  computations  of  cloud  brightness  temperatures. 
Microwave  scattering  from  clouds  has  been  discsused  in  many  places, 
such  as  Refs.  5 and  6.  Scattering  is  most  often  discussed  in  the 
context  of  radar,  because  then  even  a weak  scattered  signal  from 
a cloud  is  essentially  the  only  signal. 

The  usual  simplicity  of  microwave  radiative  transfer  also 
simplifies  data  interpretation.  The  equation  of  radiation  trans- 
fer can  usually  be  approximated  as 


-2t  -t 

T_  = RT  , e + (1  - 


B 


sky 

-T 


+ Re 


rT(r)„(V)e-f 


d£' 


T(i.)a(£)e 


a(£)d£ 


d£ 


(4) 
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where  is  the  brightness  temperature  viewed  from  a satellite. 

Jl  is  distance  along  a ray  emitted  upward  from  the  terrestrial  sur- 
face, and  £*  is  distance  along  a ray  incident  upon  the  surface; 

= £»  = 0 refers  to  the  surface,  and  £ = «>  refers  to  some  point 
above  the  atmosphere.  R is  the  surface  reflectivity,  and 

^surf  brightness  and  kinetic  temperatures  of  space  and 

the  terrestrial  surface,  respectively.  ot(£)  is  the  atmospheric 
absorption  coefficient  and  is  the  integral  of  a along  £ from 
0 to 


The  equation  of  radiative  transfer  is  thus  generally  domi- 
nated by  atmospheric  absorption,  thermal  emission  processes,  and 
surface  reflection  characteristics.  Surface  characteristics  vary 
considerably,  some  surfaces  being  largely  specular,  such  as  the 
ocean  (Ref.  7) , and  some  scattering  or  absorbing  strongly,  such 
as  snow  or  ice  (Ref.  8)  or  forested  land  (Ref.  9) . The  surface 
reflectivity  of  the  ocean  or  fresh  water  might  be  **^0.4  to  0.6, 
whereas  that  of  ice  and  snow  might  be  *«0-7  to  1.0,  and  that  of 
land  might  be  ^^0.8  to  1.0. 


The  dominant  microwave  absorbers  in  the  atmosphere  are  H^O, 

O , O , clouds,  and  precipitation.  Figure  1,  prepared  by 
^ ^ 1 . 

P.  W.  Rosenkranz,  illustrates  the  typical  opacity  of  the  ter- 
restrial atmosphere  due  to  water  vapor,  oxygen,  and  clouds.  Many 
typical  trace  constituents  also  have  weak  but  quite  observable 
spectra.  The  microwave  spectral  properties  of  the  atmosphere  have 
been  reviewed  by  Staelin  (Ref.  1) , Waters  (Ref.  3) , Rosenkranz 
(Ref.  10),  Liebe  (Ref.  11),  and  others.  With  the  recently  avail- 
able sensitive  receivers  in  the  100-300  GHz  region,  many  addi- 
tional microwave  spectral  lines  are  being  detected  in  the  atmos- 
phere for  the  first  time,  such  as  those  of  and  CO  near  100- 
200  GHz. 

Below  altitudes  of  75  km  pressure  broaderning  dominates  the 
width  of  most  microwave  molecular  resonances  and,  thus,  provides 
a means  for  determining  the  pressure  altitude  at  which 
^Personal  communication. 


364 


I U I 


1 II 


Fig.  I.  Sea  level  zenith  opacity  for  oxygen  In  the  U.S. 
Standard  Atmosphere  and  for  2 cm  of  preclpltable  water  vapor  dis- 
tributed exponentially  with  2 km  scale  height.  The  effects  of  a 
2-km  thick  stratus  cloud  are  also  shown.  Typical  measurements  for 
alone  and  plus  H^O  are  indicated  by  asterisks . (Figure 
supplied  by  Rosenkranz.) 

contributions  to  an  obseirved  spectral  line  originated.  Typical 
microwave  receivers  achieve  adequate  sensitivities  with  band- 
widths  of  ^1-100  MHZ/  which  are  generally  much  less  than  the 
observed  linewidths  of  2-2000  MHz.  This  ability  to  measure 
easily  the  precise  shapes  of  most  spectral  lines  contrasts  with 
the  situation  for  most  infrared  detectors,  which  typically  observe 
broad  bands  that  overlap  many  resonances.  Even  gaseous  selective 
absorption  infrared  filters  only  partially  limit  this  smoothing 
of  spectra  and  consequent  degradation  of  altitude  information. 

The  altitude  dependence  of  microwave  emission  differs  in 
another  way  from  that  of  infrared  sensors.  The  linear  relation 
between  temperature  and  radiated  microwave  power  differs  from  the 
strongly  nonlinear  temperature  dependence  of  the  Planck  function 
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in  the  thermal  infrared.  This  results  in  altitude  sensitivities 
of  microwave  spectra  that  are  only  modestly  dependent  on  atmos- 
pheric temperature  profiles,  whereas  these  temperatures  can  have  a 
profound  effect  on  spectra  in  the  thermal  infrared. 


II.  SOUNDING  OF  ATMOSPHERIC  TEMPERATURE  PROFILES 

Two  basic  geometries  are  appropriate  for  satellite  sensing  of 
atmospheric  temperature  profiles:  the  "nadir  mode"  wherein  the 
satellite  sensor  views  the  atmosphere  at  any  desired  angle  such 
that  the  ray  intercepts  the  terrestrial  surface,  and  the  "limb- 
scanning mode"  for  which  the  ray  does  not  intercept  the  earth  but 
passes  through  the  atmosphere  at  the  limb  of  the  planet.  Only 
nadir-mode  microwave  systems  have  been  built  to  date. 

At  present  only  two  satellites  carry  oassive  microwave 
sounders  for  temperature  profile  measurement:  Nimbus-5  carries 
Nimbus  E Microwave  Spectrometer  (NEMS) , launched  December  12,  1972 
(Ref.  12);  and  Nimbus-6  carries  Scanning  Microwave  Spectrometer 
(SCAMS) , launched  June  11,  1975  (Ref.  13) . They  each  have  two 
channels  near  1-cm  wavelength  for  observing  the  surface  and  atmos- 
pheric water,  and  three  channels  near  0.6-cm  wavelength  which  are 
on  the  opaque  wings  of  the  0.5-cm  oxygen  absorption  complex;  they 
measure  thermal  radiation  originating  from  different  atmospheric 
levels  (0-18  km) 

Temperature  profiles  may  be  determined  because  each  channel 

T (V)  responds  differently  to  the  temperature  profile  T(£)  as 
6 

characterized  by  standard  weighting  functions  W(v,£) : 


T«(v)  ^ T ^ + 
B const 


T(£)W(v,£)d£ 


(5) 


The  microwave  temperature  weighting  functions  presented  in  the 
literature  to  date  are  formed  by  simply  factoring  T(£)  from  the 
appropriate  terms  of  Eq.  (4)  to  produce  Eq.  (5) . Typical  weighting 
functions  are  presented  in  Fig.  2.  In  fact,  more  meaningful 
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Figr.  2.  Representative  atmospheric  temperature  weighting 
functions  for  frequencies  near  the  60-GHz  oxygen  band.  A nadir- 
viewing satellite  is  assumed.  A continuous  distribution  of  such 
weighting  functions  can  be  obtained  between  0 and  75  km. 


weighting  functions  could  be  found  that  are  less  temperature 
dependent.  If  one  expands  the  integrand  of  Eq.  (5)  in  a Taylor 
series  about  a presumed  temperature  profile  and  keeps  only  the 
first  order  term  T{l)VI^  (v,H)  in  the  integral,  then  the  resulting 
temperature  weighting  function  W (v,£)  is  usually  not  greatly  dif- 
ferent from  the  customarily  presented  W(v,5,)  because  of  the 
linearity  of  the  microwave  spectrum. 

These  weighting  functions  are  very  similar  to  those  in  the 
infrared  region.  The  microwave  functions  are  generally  broader 
in  the  lower  troposphere  where  the  lapse  rate  is  significant  and 
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the  Planck  function,  therefore,  sharpens  the  infrared  temperature 
weighting  functions  based  on  the  CO^  bands  near  15  ym  and  especially 
4.3  ym.  Near  the  tropopause  and  in  the  lower  stratosphere,  how- 
ever, the  Planck  function  severely  broadens  the  infrared  functions, 
and  the  low  spectral  resolution  of  present  infrared  sounders 
broadens  them  further  at  still  higher  altitudes. 

The  different  behavior  of  the  microwave  and  infrared  weighting 
functions  suggests  optimum  satellite  systems  will  incorporate  both 
spectral  regions,  e.g.,  the  narrow  4.3  ym  weighting  functions  for 
the  0-8  km  region,  and  the  narrower  microwave  functions  for  alti- 
tudes above  -8  km.  One  present  compensating  virtue  of  broadband 
infrared  sensors  is  that  their  greater  sensitivities  peinnit  shorter 
integration  times,  and,  therefore,  they  can  yield  higher  spatial 
resolution.  Thus,  there  can  be  complex  system  design  trade-offs 
between  the  ranges  of  vertical,  horizontal,  and  temperature  reso- 
lution that  various  microwave  and  infrared  sensors  can  provide. 

A second  important  property  of  microwave  temperature  sounders 
is  their  ability  to  penetrate  all  cirrus  clouds  and  almost  all  non- 
precipitating clouds.  A study  of  the  degradation  of  NEMS  tempera- 
ture retrievals  by  clouds  (Ref.  14)  shows  that  only  intense  pre- 
cipitation bands,  subh  as  the  Inter-Tropical  Convergence  Ozone 
(ITCZ)  or  major  extratropical  storms,  have  any  measurable  effect 
on  temperature  retrievals.  This  effect  arises  from  perturbations 
in  the  brightness  temperature  of  ^ -5  to  +2  K for  channels 
sensing  ^3-S  km  altitude.  These  effects  are  normally  negligible 
because  most  heavy  clouds  have  temperatures  close  to  those  of  the 
observed  brightness  temperatures,  and  because  most  heavy  clouds 
do  not  fill  the  entire  ~300-km  viewing  zone  of  NEMS.  Approximately 
0.4%  of  NEMS  obse2Tv^ations  for  the  4-km  weighting  function  were 
perturbed  more  than  1 K,  and  these  were  primarily  over  the  ITCZ 
(Ref.  14) . 

Because  the  radiative  transfer  problem  in  the  opaque  bands 
is  nearly  linear  in  temperature,  and  since  the  temperature 
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variations  are  nearly  a jointly  Gaussian  process,  a linear  least- 
square-error  retrieval  process  for  temperature  profiles  is  nearly 
optimum;  conventional  linear  regression  procedures  can  be  used. 
Analyses  of  the  accuracy  of  such  temperature  retrievals  for  NEMS 
were  performed  by  Waters,  et  al.  (Ref-  15)  and  Wilcox  and  Sanders 
(Ref.  16) . Typical  accuracies  are  presented  in  Fig.  3 (From  Ref. 
15)  . 

The  reported  inaccuracies  in  the  retrievals  arise  from 
(1)  receiver  noise  and  calibration  errors,  amplified  by  the 
retrieval  process;  (2)  errors  in  the  presumed  atmospheric  propa- 
gation characteristics;  (3)  retrieval  errors  resulting  from  the 
inability  of  coarse  weighting  functions  to  respond  to  fine  struc- 
ture in  the  true  temperature  profiles,  which  is  most  pronounced 
near  the  surface  and  the  tropopause ; and  (4)  errors  in  the  com- 
parison sources,  which  are  typically  single  or  smoothed  sets  of 
radiosondes.  Receiver  noise  for  NEMS  was  -0.15  K per  sounding. 
Calibration  errors  are  believed  to  be  less  than  2 K and  essentially 
constant;  therefore,  they  contribute  primarily  to  mean  retrieval 
errors  of  ± 2 K,  but  very  little  to  root-mean-square  (rms)  errors 
about  the  mean.  Errors  in  the  propagation  equations  are  believed 
to  be  small  and  to  produce  systematic  errors  largely  indistinguish- 
able from  calibration  errors.  Most  of  the  inferred  retrieval 
errors  are  believed  to  result  from  coarse  weighting  functions  and 
radiosonde  errors.  Evidence  for  this  hypothesis  follows  from  com- 
parison of  the  rms  and  mean  errors  between  the  observed  T^  values 

and  those  computed  theoretically  for  interpolations  of  the  corre- 
Z Z 

spending  0 and  12  National  Meteorological  Center  (NMC)  tempera- 
ture field  analyses;  these  rms  errors  about  the  mean  regression 
line  are  ^1,1  K,  0.7  K,  and  0.7  K for  channels  peaking  near  4 km, 

11  km,  and  18  km,  respectively.  Removing  the  -0.15  K contribution 
of  receiver  noise  implies  ims  NMC  errors  of  only  -0.7  K,  which  is 
not  implausible,  even  if  all  propagation  and  calibration  errors 
were  zero. 
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Fig,  3,  Discrepancies  (rms)  between  (1)  NEMS  temperature 
retrievals  (mean  error  deleted)  and  NMC  analyses  interpolated  in 
time  and  space  in  data-rich  regions , (2)  NEMS  retrievals  and  82 
coincident  Radiosonde  Observations  (RAOB*s)^  (3)  a statistically 
modeled  point-temperature  sensor  and  a hypothetical  area  sensor 
sampling  ^4  x 10^  km^  for  statistics  above  54  N latitude , and 
(4)  the  geometric  sum  of  (1)  and  (3),  (1  bar  = 100  kPa.) 

The  data  of  Fig.  3 reveals  that  much  more  error  was  deduced 

when  comparisons  were  made  with  individual  radiosondes  instead  of 

the  NMC  grid.  There  are  two  hypotheses  to  explain  this  difference 

The  first  explanation  is  that  the  excessive  errors  for  radiosondes 

are  due  to  aliasing  errors;  the  large  area  averaged  by  NEMS  is 

better  represented  by  the  NMC  analyses,  which  are  derived  by 

averaging  nearby  radiosondes.  This  hypothesis  has  been  tested  by 
2 

Ledsham,  who  computed  the  auto-correlation  functions  of  the 
2 

Personal  communication. 
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atmospheric  temperature  field' at  several  altitudes  and  latitudes, 
and  then  predicted  the  aliasing  errors  that  could  be  expected. 

These  errors  are  plotted  in  Fig.  3 as  the  area-point-sensor  error. 
This  result  is  consistent  with  the  observed  rms  discrepancies 
between  the  NMC  analyses  and  individual  radiosondes,  and  consistent 
with  the  discrepancies  between  the  NEMS  comparisons  with  NMC 
analyses  and  radiosondes,  all  shown  in  Fig.  3. 

The  second  hypothesis  (Ref.  17)  is  that  the  radiosondes  are 
more  representative  than  the  NMC  analyses,  which  smooth  the  fine 
vertical  structure  in  the  comparison  profiles  and,  thus,  yield 
artificially  smaller  rms  errors.  If  this  second  hypothesis  is 
correct,  then  there  may  be  an  unidentified  extra  source  of  error. 
Although  this  appears  unlikely,  a more  definitive  study  remains 
to  be  done. 

SCAMS  is  similar  to  NEMS  but  scans  left  to  right  in  13  1-sec 
steps  with  7.5^  beamwidths.  This  yields  maps  of  temperature  pro- 
files -2500  km  wide  and  with  -150-300  km  resolution.  Examples 
are  shown  in  Fig.  4. 

The  60-GHz  band  of  oxygen  is  not  the  only  opaque  region  that 
can  be  used  for  temperature  sounding;  the  isolated  118.75  GHz 
resonance  of  is  even  more  opaque  than  the  60-GHz  complex. 
Although  cloud  effects  can  be  perhaps  twice  as  great  near  118  GHz, 
the  effects  are  still  important  primarily  in  the  ITCZ  and  large 
extratropical  storms.  The  weighting  functions  are  slightly 
broader  at  118  GHz  than  near  60  GHz,  and  can  reach  to  altitudes 
near  80  km  at  the  very  center  of  the  resonance. 

Limb-scanning  observations  of  the  atmospheric  temperature 
profile  require  large  antennas  in  order  to  obtain  useful  angular 
resolution.  The  excellent  altitude  resolution  for  limb-scanning 
systems  arises  because  the  majority  of  the  emission  originates 
within  » 2 km  of  the  ray  tangent  height  if  the  atmosphere  is  not 
opaque.  If  the  satellite  is  1000  km  from  the  tangent  point  for  a 
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Fig,  4,  Representative  maps  produced  by  SCAMS,  on  3 and  5 
February  1976  * indicates  5 February  data  . (A)  latitude  and 

longitude  at  nadir  (image  center) , (B)  brightness  temperature  at 
31.6  GHz  (black  is  hot;  note  land  masses),  (C)  retrieved  water 
vapor  over  ocean  (black  is  humid)  , (D)  retrieved  liquid  water  over 
ocean  (black  is  heavy  clouds  or  precipitation) , (E)  retrieved 
average  air  temperatures  "F"  1000-500  mb  (note  uncompensated 
effects  of  high  elevation  land  over  Antarctica) ; contours  are  each 
2K  wide,  (F)  average  temperature  500-250  mb,  and  (G)  average 
temperature  250-100  mb. 


given  ray,  then  an  antenna  beamwidth  of^l  arc  minutes  is  required 
to  achieve  2-km  vertical  resolution.  This  requires  an  antenna 
~3  m long  in  the  vertical  direction  at  60  GHz,  or  -1.5  m at  118 
GHz.  These  dimensions  are  within  the  capability  of  the  space 
shuttle  and  modern  antennas - 

The  techniques  one  can  employ  at  microwave  frequencies  (Ref. 

18)  are  again  analogous  to  those  available  in  the  infrared  (Ref. 

19)  - Weighting  functions  available  near  125  GHz  are  illustrated 
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in  Fig-  5/  and  brightness  temperatures  as  a function  of  tangent 
height  are  illustrated  in  Fig.  6 for  two  different  model  atmos- 
pheres . 

One  of  the  problems  in  limb-scanning  is  determination  of  the 
absolute  pressure  altitude  of  the  scan-  Accuracies  better  than 
«200  m are  desirable.  Note  that  the  warmer  atmosphere  in  Fig.  6 
appears  hotter  at  those  view  angles  (or  tangent  heights)  corre- 
sponding to  greater  optical  depths  and  appears  cooler  at  others. 
This  effect  arises  because  the  hotter  gas  is  more  nearly  trans- 
parent and  actually  emits  less  per  meter  of  path  length.  For  each 
observing  frequency  there  is  a "neutral  point,"  a view  angle  or 
ray-tangent  pressure  for  which  the  brightness  is  largely  indepen- 
dent of  the  atmospheric  temperature  profile.  Thus,  this  constant 
pressure  altitude  can  conceptually  be  located  by  determining  the 
view  angle  which  exhibits  the  known  neutral-point  brightness 
temperature;  in  practice,  a more  mathematical  estimation  procedure 
would  be  used. 

Figure  6 suggests  that  the  brightness  temperature  varies 
about  0.2  K in  10  meters,  so  a radiometer  sensitivity  of  0.2  K can 
enable  the  reference  pressure  altitude  to  be  inferred  with  an 
accuracy  several  times  worse  than  this,  say  50  meters,  which  is 
more  than  adequate.  Corrections  must  also  be  made  for  refractive 
effects  and  the  effects  of  trace  constituents;  these  also  intro- 
duce errors  of  varying  significance.  A variety  of  standard  linear 
and  nonlinear  retrieval  procedures  can  be  adapted  to  this  problem. 

There  are  two  other  independent  ways  to  deteimine  atmospheric 
temperatures  from  limb-scanning  spectra.  Above  those,  altitudes 
where  pressure  broadening  dominates,  i.e.,  above  »»  60  km,  Doppler 
broadening  becomes  important.  Measurements  of  Doppler  width  can 
yield  estimates  of  the  translational  temperature  of  the  oxygen. 

The  second  technique  has  long  been  used  for  optical  spectra;  the 
relative  populations  of  a series  of  energy  levels  is  determined, 
and  these  yield,  in  the  case  of  02#  a rotational  temperature. 
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WEIGHTING  FUNCTION  FOR  12  KM  TRRGET  HEIGHT 


Fig.  5.  Limb-scan-mode  temperature  weighting  functions 
obtained  near  118.75  GHz  for  a ray  tangent  at  12-km  altitude.  The 
peaks  at  12  km  are  truncated.  These  functions  can  be  translated 
continuously  in  altitude  by  scanning  the  limb. 

These  techniques  have  not  yet  been  employed  for  atmospheric 
probing,  but  have  been  proposed  for  a limb- scanning  experiment  on 
the  space  shuttle . 

Several  avenues  for  improvement  of  microwave  retrievals  exist. 
The  present  linear  and  quasilinear  techniques  are  nearly  optimum 
for  single  position  retrievals,  but  SCAMS  and  future  temperature 
sounders  will  generally  map  the  temperature  field  in  three  dimen- 
sions. Some  appropriate  smoothing  could  be  employed  to  improve 
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temperature  retrievals  because  the  temperature  field  is  correlated 
over  ~ 1000  km  in  the  troposphere,  and  -2000  km  in  the  stratosphere. 
Similar  coorelations  exist  in  time,  and  they  also  could  be  utilized, 
as  could  independent  data  sets,  such  as  the  ciirrent  NMC  analysis, 
etc.  These  correlations  can  be  extrapolated  to  greater  distances 
in  time  and  space  by  means  of  atmospheric  prediction  models;  thus, 
enabling  each  retrieval  to  incorporate  still  larger  sets  of  inde- 
pendent observational  data. 

Furthermore,  those  heavy  precipitation  cells  that  sometimes 
introduce  errors  into  retrievals  are  generally  much  smaller  than 
the  correlation  length  of  the  temperature  field,  and  should,  there- 
fore, be  detectable  and  partially  correctable. 

One  formulation  for  such  problems  is  that  of  Kalman-Bucy 
filters  (Ref.  20) . In  this  construct,  the  process  to  be  estimated 
is  assumed  to  be  drawn  from  the  class  of  first-order  continuous 
time  infinite-state  Markov  processes.  Its  action  upon  input  data 
is  to  optimally  estimate  the  truly  random  elements  controlling 
transitions  between  states.  These  random  elements  provide  infor- 
mation which  is  then  used  along  with  deterministic  elements,  to 
update  the  filter's  estimate  of  the  current  state  of  the  process, 
i.e.,  the  observables.  This  model  permits  estimation  over  a finite 
obseirvation  window  and  estimation  of  nonstationary  processes, 
neither  of  which  can  be  treated  by  conventional  Wiener  theory. 
Furthermore,  nonlinear  relationships  between  the  random  elements 
and  the  estimated  observables  can  be  introduced  in  a natural  man- 
ner. This  and  the  Markov  process  assumption  produce  a very  con- 
venient approach  for  retrieval  of  temperatures  or  any  other 
meteorological  parameters.  In  fact,  such  a retrieval  process 
could  even  be  combined  with  the  numerical  weather  prediction 
process,  with  the  present  numerical  prediction  models  being  incor- 
porated as  part  of  the  processor  that  estimates  the  observables. 
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III.  SOUNDING  OP  ATMOSPHERIC  COMPOSITION 


Figure  1 illustrated  resonances  of  oxygen  and  water  vapor. 
Other  important  molecules  include  O3,  N2O,  and  CO.  Stronger 
resonances  below  200  GHz  are  included  in  Table  1. 

TABLE  1 

Strongest  Microwave  Resonances  below 
200  GHz  of  Trace  Constituents 


Species 

Re  s onanc  e (GHz) 

H20 

22.235 

183.310 

O3 

101.737 

110.836 

125.087 

142.175 

165.784 

184.378 

195.430 

14  16 

N„  0 

100.492 

125.614 

150.735 

175.856 

115.271 

The  spectra  of  these  and  other  molecules  was  recently  reviewed 
by  Waters  (Ref.  3)  . 

Such  spectral  lines  can  be  observed  passively  against  three 
different  backgrounds:  (1)  cold  space  reflected  by  the  ocean; 

(2)  an  opaque  atmosphere  at  a different  temperature  that  provides 
contrast;  and  (3)  cold  space,  as  in  the  limb- scanning  mode. 

Passive  microwave  satellite  measurements  of  atmospheric  com- 
position have  been  made  only  of  water  vapor  and  liquid  water  over 
ocean;  the  high  reflectivity  of  ocean  (~  60  percent)  near  the 
1.35-cm  resonance  provides  a cold  background  for  thermal  emission 
from  both  water  vapor  and  liquid  water.  Figure  7 illustrates  the 
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Figr.  7.  Computed  brightness  temperature  of  smooth  ocean  at 
288  K overlaid  by  (1)  no  atmosphere;  (2)  a nominal  atmosphere, 
including  a modest  amount  of  water  vapor;  (3)  the  same  plus  a 
heavy  stratus  cloud;  and  (4)  a more  humid  but  cloud-free  atmos- 
phere . 


brightness  temperatures  that  would  be  obseirved  at  nadir  over  a 
smooth  ocean  surface. 

The  spectrum  in  the  absence  of  any  atmosphere  would  not  be 
flat  (see  Fig.  7)  because  the  dielectric  constant  of  sea  water 
depends  on  frequency  as  well  as  on  temperature r and  salinity. 

This  temperature  dependence  has  the  curious  property  that  the  sea 
brightness  temperature  is  nearly  independent  of  sea  temperature  at 
wavelengths  near  1 cm;  it  increases  and  decreases  slightly  with 
temperature  for  longer  and  shorter  wavelengths,  respectively. 

This  fact  largely  removes  sea  temperature  as  a major  source  of 
error  in  atmospheric  water  retrievals;  climatic  values  for  sea 
temperature  are  generally  sufficiently  accurate. 
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Atmospheric  water  vapor  appears  in  emission  against  the  cold 
ocean  background  with  a line  shape  and  amplitude  that  is  nearly 
proportional  to  the  absorption  coefficient  of  the  atmosphere. 

This  almost  linear  relationship  results  from  the  Rayleigh-Jeans 
approximation  of  Eq.  (1)  and  because  the  atmosphere  is  nearly 
transparent,  as  seen  in  Fig.  1,.  even  at  the  center  of  the  resonance 
and  over  the  tropics.  The  H^O  line  width  for  each  atmospheric 
layer  is  proportional  to  its  pressure,  and,  thus,  the  total  line 
shape  contains  information  about  the  water  vapor  profile  which  can 
be  retrieved.  Linear  and  quasi-linear  techniques  have  been  used 
successfully  for  estimating  water  vapor  abundances  and  profiles; 
the  problem  is  sufficiently  linear  that  they  generally  suffice 
(Refs.  21,  22,  and  23). 


Water  vapor  weighting  functions  W(v,£)  may  be  defined  as 


B^(v)  T ^(v)  + 
B const 


P„  ^{^)W(v,i2.)di2. 
®2^ 


(5) 


_ 1 -|_2 

where  W(v,il)  (K  g m ) is  independent  of  the  water  vapor  profile 

— 3 

PH^0(£)  (g  m ) for  small  perturbations  of  p„  about  a standard 

atmosphere.  Because  water  vapor  weighting  functions  near  1-cm 

wavelength  have  widths  with  a half -amplitude  of  - 2 pressure  scale 

heights  or  =16  km,  water  vapor  retrievals  can  extract  little  more 

than  the  total  tropospheric  abundance  and  perhaps  a water  vapor 

scale  height  (p  usually  is  exponential  with  a scale  height 

H2O 

near  2 . 2 km) . 


The  presence  of  clouds  and  precipitation  complicates  water 
vapor  retrievals  only  slightly  because  the  nonresonant  character 
of  cloud  absorption  can  be  distinguished  from  the  resonant  charac- 
ter of  water  vapor  (see  Figs.  1 and  7) , as  noted  by  Staelin  (Ref. 

21)  and  others  (Refs.  22  and  23).  Rosenkranz,  et  al.  (Ref.  22)  con- 
cluded theoretically  that  rms  accuracies  of  1.3  mm  precipi table 
water  vapor  and  0.06  ram  liquid  water  could  be  obtained  from  obser- 
vations over  the  ocean  at  only  two  frequencies,  such  as  22.2  and 
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31.4  GHz.  Comparisons  of  such  two-frequency  water  vapor  estimates 
with  radiosondes  yield  rms  errors  of  ^1.2  and  4.4  mm  (Ref-  23); 
the  poor  agreement  is  very  likely  diie  to 

1.  the  point-sampling  character  of  radiosondes  as  contrasted 
with  the  areas  of  ^10^  km^  averaged  by  NEMS  or  SCAMS,  and 

2.  the  inability  of  radiosondes  to  respond  well  to  low 
humidity. 

Theoretical  errors  can  be  reduced  through  use  of  more  than  two 
sounding  frequencies. 

Heavy  precipitation  can  be  partially  opaque  (>  4 dB  attenu- 
ation) near  1-cm  wavelength  for  precipitation  rates  above  ~3  mm 
hr  Because  such  precipitation  is  obten  confined  to  convective 
regions  with  an  areal  extent  much  smaller  than  the  ~10^  km^  sampled 
by  NEMS  or  SCAMS,  the  water  vapor  retrievals  are  not  significantly 
affected,  as  suggested  in  Fig.  8 which  presents  water  vapor  and 
liquid  water  retrievals  from  NEMS  over  an'  Indian  Ocean  typhoon 
(Ref.  23) . If  high  angular  resolution  sounders  were  used,  then 
the  water  vapor  retrievals  over  the  opaque  storm  cells  would 
naturally  be  less  accurate. 

The  sounding  of  water  vapor  profiles  near  the  core  of  the 
opaque  183.3  GHz  resonance  requires  a different  retrieval  pro- 
cedure, similar  to  that  used  for  interpreting  satellite  observa- 
tions of  the  opaque  infrared  bands  of  water  vapor.  If  the  tempera- 
ture profile  is  known,  then  each  value  of  brightness  temperature 
on  the  observed  resonance  corresponds  to  the  altitude  which  is 
being  reached  at  that  frequency;  this  altitude  level  is  usually 
near  an  optical  depth  of  unity.  We  now  define  the  contribution 
function,  C(v,il)  , which  describes  the  degree  to  which  the  observed 
brightness  temperature  is  determined  by  the  temperature  at  various 
altitudes;  it  is  naturally  peaked. 

Tg(v)  = I T(Jl)C(v,Jl)dJl  (6) 

0 

C(v,Jl)  goes  to  zero  adx)ve  the  atmosphere  and  at  large  optical 
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Fig.  5.  Typhoon  22  February,  1973.  Images  were  made  near 
19  GHz  by  Electrically  Scanned  Microwave  Radiometer  (ESMR)  (dark 
is  warm  and  huxnid)  , Temperature  Humidity  Infrared  Radiometer  (THIR) 
6.7  Pm  (dark  is  dry)  , and  THIR  11.5  ym  (dark  is  warm  or  cloud- 
free)  . NEMS  retrievals  along  the  center  of  these  images  are  pre- 
sented for  liquid  water,  water  vapor,  and  54.9  GHz  brightness 
temperatures.  Note  the  lack  of  interference  between  the  water 
vapor  and  liquid  water  retrievals. 
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depths.  The  width  of  this  function  typically  would  approximate 
the  water  vapor  scale  height  in  the  altitude  zone  being  sounded. 
Because  the  contribution  function  depends  strongly  on  the  water 
vapor  profile/  it  is  quite  different  from  the  usual  weighting  func- 
tion, a concept  which  xs  inappropriate  here.  The  peak  of  the  con- 
tribution function  for  the  183  GHz  H^O  resonance  typically  would 
fall  between  0 and  ==  8 km  for  frequencies  between  the  wing  and  the 
line  center,  respectively.  Still  more  opaque  resonances  at 
shorter  wavelengths  enable  water  vapor  profiles  to  be  sensed  to 
still  higher  altitudes.  Little  can  be  learned  in  regions  with  zero 
lapse  rate,  however,  except  perhaps  the  total  water  vapor  content 
of  that  layer.  Most  other  resonances,  such  as  those  of  N^O,  CO, 
and  some  of  are  generally  not  opaque  and  can  be  observed  over 
ocean  or  against  the  background  of  an  opaque  troposphere;  the  con- 
trast depends  upon  the  temperature  difference  between  the  atmos- 
phere at  the  peak  of  the  resonant  contribution  function  and  that  of 
the  background.  Although  no  such  microwave  measurements  have  yet 
been  made  from  space,  they  are  quite  feasible. 

The  most  sensitive  and  precise  way  to  measure  atmospheric  com- 
position is  by  means  of  limb  scanning.  The  path  length  of  the 
radiation  is  - 300  km  near  the  tangent  point  (where  the  ray  is 
parallel  to  the  terrestrial  surface  or  geoid) , as  contrasted  with 
~ 10  km  for  nadir  obse2rvations.  This  factor  of  -30  is  augmented 
another  factor  of  - 10  by  the  greater  contrast  provided  by  cold 
space,  as  opposed  to  that  supplied  by  the  underlying  atmosphere 
for  nadir  observations.  Waters  (Ref.  3)  has  analyzed  this  pos- 
sibility for  several  trace  constituents  and  it  appears  that  CO 
might  be  sensed  up  to  -100  km  or  more  using  the  230  GHz  resonance, 
H^O  to  ^100  km  using  183  GHz,  to  -100  km  at  184  GHz,  etc.  Such 
constituents  might  be  sensed  down  to  altitudes  of  - 8 km,  depending 
on  the  altitudes  of  water  clouds.  The  abundances  can  be  measured 
with  accuracies  of  a few  percent  with  2 km  vertical  resolution 
for  constituents  that  are  sufficiently  abundant.  Exploitation  of 
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these  techniques  will  require  somewhat  more  precise  antennas  than 
those  used  heretofore,  but  they  are  within  the  present  state  of 
the  art  and  compatible  with  the  constraints  imposed  by  the  space 
shuttle  program. 

IV.  SOUNDING  OF  ATMOSPHERIC  PRESSURE 

There  are  at  least  two  approaches  to  the  sounding  of  atmos- 
pheric pressure  using  microwave  techniques  and  a single  satellite. 
These  include  measurement  of 

1.  the  pressure-dependent  differential  atmospheric  cibsorption 
near  a strong  resonance  of  a constituent  like  O^,  which  has  a 
nearly  constant  mixing  ratio;  this  can  be  done  in  principle  using 
radar  techniques  from  a satellite  viewing  nadir  over  ocean;  and 

2.  brightness  temperatures  at  the  limb  at  one  or  more  fre- 
quencies, as  a function  of  altitude  of  the  ray  tangent  point,  which 
must  be  known  to  a precision  greater  than  that  desired  for  the 
measurements  of  pressure  altitude. 

The  successful  operation  of  any  of  these  techniques  would  require 
a level  of  precision  in  one  or  more  particulars  beyond  the  present 
state  of  the  art;  several  of  these  necessary  advances  appear  to  be 
feasible. 

Pressure  measurements  have  been  made  on  other  planets  for 
decades  by  observing  the  widths  of  spectral  lines  formed  by 
absorption  of  sunlight  reflected  from  the  planetary  surface  or 
from  cloud  layers.  Radar  measurements  of  Venus  near  3-  and  10-cm 
wavelength  were  used  in  a similar  manner  by  Barrett  and  S tael in 
(Ref.  24)  to  support  the  contention  that  the  surface  pressure  on 
Venus  was  very  high;  CO^  was  the  presumed  microwave  absorber.  The 
same  technique  can  be  employed  from  Earth-orbiting  satellites,  for 
example,  using  radar  at  two  frequencies  -0.5-1  GHz  apart,  centered 
near  52-52.5  GHz.  Such  radar  echoes  might  suffer  attenuations  dif- 
fering by  5-10  dB  due  to  the  nearby  absorption  band;  the 
absorption  is  a function  of  the  mass,  temperature,  and  pressure  of 
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the  atmosphere f particularly  in  that  portion  below  3 km  where  most 
absorption  would  occur.  Since  such  absorption  is  approximately 
proportional  to  the  square  of  pressure,  differential  attenuation 
should  be  measured  with  an  accuracy  of  ~ (10  dB/1000)  x 2 = 0.02  dB 
if  errors  of  ~1  mb  are  to  be  achievable.  This  is  an  unusually 
stringent  requirement  and  would  require  very  precise  calibration. 
The  measurement  and  interpretation  problem  is  compounded  by  the 
facts  that  (1)  the  radar  echo  strength  varies  randomly  at  the  two 
operating  frequencies,  (2)  sea  state  reflectivity  and  absorption  by 
clouds  and  water  vapor  are  slightly  different  at  the  two  frequen- 
cies, and  (3)  the  abso2rption  is  somewhat  temperature  dependent 
(fortunately,  the  temperature  dependence  is  near  a minimum  at  52. 
GHz) . These  difficulties  notwithstanding,  pressure  accuracies  of 
a few  mb  appear  to  be  achievable  in  principle. 

The  use  of  limb-scanning  pressure  sensors  has  the  advantage 
that  such  systems  could  map  the  entire  Earth  rather  than  only  nadir 
points  over  ocean-  The  technique  involves  use  of  the  neutral- 
point  concept  described  earlier,  wherein  a particular  brightness 
temperature  at  a particular  frequency  tends  to  occur  for  a ray 
tangent  height  at  a particular  known  pressure  altitude.  Because 
this  neutral-point  is  slightly  dependent  on  the  temperature  pro- 
file, and  si-nce  the  ray  is  refracted  so  as  to  displace  the  apparent 
tangent  height  by  perhaps  several  hundred  meters,  a slightly  non- 
linear inversion  procedure  would  appear  to  be  appropriate.  Although 
no  inversion  experiments  have  been  performed,  it  appears  likely 
that  pressure  altitudes  could  be  determined  with  an  accuracy  of 
-25-100  m by  use  of  absorption  above  "^10  km  altitude.  This 
altitude  must,  of  course,  be  referenced  to  the  Earth,  which  would 
require  the  satellite  position,  altitude,  and  angular  orientation 
of  the  instrument  to  be  known  with  accuracies  on  the  order  of 
kilometers,  meters,  and  1 arc  sec,  respectively;  this  appears  to  be 
feasible  but  difficult. 
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similar  procedures  could  be  employed  in  the  infrared  region, 
but  even  the  slightest  trace  of  haze  or  clouds  could  become  a 
serious  source  of  error  here.  An  advantage  of  infrared  is  that  the 
antenna  would  be  smaller  and  easier  to  point  precisely. 

Neither  of  these  two  pressure  measuring  techniques  provides 
a complete  pressure  profile;  the  radar  measures  an  average  pressure 
over  the  lowest  few  km  of  the  atmosphere,  and  the  passive  limb- 
sounding technique  can  determine  pressure  altitudes  ~ 10-60  km 
(i.e.,  that  range  over  which  pressure  broadening  dominates  the 
line  widths,  and  water  vapor  and  refractive  effects  are  small) . 
Either  technique  must  be  supplemented  by  temperature  profile 
measurements  to  yield  a complete  pressure  profile,  and  the  accuracy 
of  these  derived  profiles  degrades  for  altitudes  far  from  the  pres- 
sure reference.  Therefore,  a good  combined  system,  if  feasible, 
might  use  (1)  limb-scan  measurements  to  map  globally  the  pressures 
near  the  tropopause,  (2)  nadir  radar  measurements  to  provide  more 
accurate  but  more  sparse  surface  pressures  over  ocean,  and 
(3)  microwave  or  infrared  temperature  profile  spectrometers  for 
enabling  the  pressure  measurements  to  be  extrapolated  to  all 
altitudes . 


SYMBOLS 


B receiver  bandwidth,  Hz 

C(v,il)  water  vapor  contribution  function 

d diameter  of  droplets 

h Planck's  constant 

-2  -1  -1 

I intensity,  W m Hz  ster 

k Boltzmann  * s constant 

Zfl'  distance  along  ray  path 

n refractive  index  of  droplets 

P power  density,  W Hz  ^ 

AP/AT  ratio  of  power  change  to  temperature  change 

R surface  reflectivity 


385 


T 


const 


"^sky 

"^surf 

At 

rms 

W(v.il) 

W (v,Z) 

\ 


V 

PH20 

T 


T 

O 


temperature,  K 
antenna  temperaturer  K 

an  appropriate  constant  temperature,  K 
receiver  noise  temperature,  K 

sky  brightness  temperature  above  atmosphere,  K 
surface  temperature,  K 
receiver  sensitivity,  K 

temperature  or  water  vapor  weighting  function 
first  order  temperature  weighting  function 
wavelength,  m 
frequency,  Hz 

—3 

water  vapor  density,  g m 
receiver  integration  time,  sec 
atmospheric  opacity,  nepers 
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DISCUSSION 


Twomey:  We  seem  to  have  uncovered  another  duality  here  apart  from 

the  one  Henry  Fleming  spoke  about . When  the  infrared  people  com- 
pare their  retrievals  with  the  radiosondes  they  say  we  don’t  seem 
to  have  done  quite  as  well  as  we  hoped  for  with  this.  When  the 
microwave  people  compai^e  their  results  with  the  radiosondes  they 
say  the  radiosondes  don’t  seem  to  be  doing  very  well. 

Westwater:  You  mentioned,  I believe,  that  your  H2O  retrievals  dif- 

fered from  the  theoretical  results  by  about  a factor  of  three  and 
I assume  that  is  in  standard  deviation*  Did  the  deviations  show 
any  systematic  component?  Were  they  greater  for  the  larger  water 
content  or  were  they  more  or  less  uniform  throughout  the  entire  dis- 
tribution of  water  vapor  in  the  atmosphere? 

Staelln:  Essentially  uniform.  That  is  an  interesting  question. 

One  would  expect  nonuniformity  in  the  errors  because  we  were  using 
a linear  estimator  in  this  initial  procedure,  even  though  the  prob- 
lem is  nonlinear  Here  I think  a partial  explanation  lies  in  the 
statistics  of  the  atmosphere.  I think  the  water  vapor  scale 
heights  may  be  systematically  larger  where  we  have  larger  amounts 
of  water  vapor,  and  this  alters  our  retrievals  because  our  weighting 
function  is  nonuniform  with  altitude.  These  two  effects  may  tend 
to  cancel;  the  underestimate  due  to  high  H2O  opacity  balances  the 
overestimate  that  results  for  large-scale  heights. 

Westwater:  And  the  second  question,  was  there  any  correlation 

between  the  error  and  the  amount  of  liquid  water  content  that  was 
also  inferred?  In  other  words,  were  the  clouds  affecting  the 
retrievals  somewhat  more  than  what  your  theory  would  predict? 

Staelin:  Ground  truth  data  is  poor  near  intense  storms.  However, 

satellite  passes  over  the  ITCZ  have  yielded  water  vapor  retrievals 
that  vary  smoothly  and  continuously  across  regions  where  the  liquid 
water  retrievals  vary  abruptly  from  zero  to  maximum  and  back.  Such 
insensitivity  to  liquid  water  suggests  this  is  not  a problem. at 
this  spatial  resolution  of  the  order  of  200  km. 

Plant:  You  pointed  out  the  nicety  of  microwave  soundings  in  a 

cloudy  atmosphere.  Could  you  make  some  comments  on  a comparison 
between  a microwave -only  temperature  sounding  and  an  infrared-only 
temperature  sounding  as  you  go  from  a cloudy  atmosphere  to  par- 
tially cloudy  atmosphere  to  clear  atmosphere? 

Staelin:  That’s  a loaded  question.  I think  it  depends  on  the 

details  of  the  systems.  In  a clear  atmosphere,  the  principal  dif- 
ference between  infrared  and  microwaves  is  that  the  temperature 
weighting  functions  in  the  microwave  region  are  decidedly  narrower 
above  the  tropopause.  Below  the  tropopause,  where  the  temperature 
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gradient  is  significant,  the  infrared  channels  yield  comparable 
widths  and  the  four-micron  functions,  in  particular,  become  nar- 
rower as  they  approach  the  surface.  In  partial  cloudiness,  it 
depends  on  the  infrared  scientist's  agility  in  retrieving  through 
partially  cloudy  conditions . Infrared  cannot  penetrate  unbroken 
cirrus,  whereas  cirrus  is  totally  transparent  at  the  microwave 
frequencies  we  are  using.  In  heavy  precipitating  clouds,  the 
microwave  channels  sounding  below  "^10  km  will  be  affected,  and  it 
again  depends  on  one's  agility  in  handling  that  problem.  The  per- 
centage of  the  time  that  these  sensor  types  have  problems  for 
various  weather  conditions  has  been  analyzed  to  some  extent  by 
W.  Smith  and,  perhaps,  others.  I don't  recall  the  quantitiative 
results,  but  there  clearly  is  an  advantage  to  the  microwave  system. 

J.  Russell:  In  the  weighting  functions  you  showed  for  the  limb 

mode,  I think  you  stated  it  was  2 kilometers  width  at  the  half- 
height point.  Was  that  for  an  infinitesimal  becim  width? 

Staelin:  Yes.  And  half-height  in  this  case  is  not  really  the 

right  term  because  the  peak  amplitudes  of  those  weighting  functions 
are  much  greater  than  shown  in  the  figure  which  was  clipped  at 
the  right  margin. 

J.  Russell:  How  much  does  the  width  change  with  a finite  beam 

width? 

Staelin:  One  would  convolve  the  two  functions.  To  first  order,  I 

would  take  the  geometric  sum  of  the  two  widths.  If  we  have  a 2-* 
kilometer  beam  and  a 2- kilometer  weighting  function,  that  would  be 
roughly  3 kilometers. 

Chahine:  How  small  can  the  field  of  view  be  in  your  12-channel 

sounder , reasonably? 

Staelin:  The  12-channel  sounder  that  presently  exists  has  a 7.5^ 

field  of  view,  which  is  the  same  as  present  microwave  sounder  ilhit 
on  the  Tiros-N  Satellite. 

Chahine:  Which  is  how  many  kilometers  at  the  surface? 

Staelin:  I forget  the  altitude  of  Tiros-N.  I think  the  footprint 

is  on  the  order  of  150  kilometers. 

Kaplan:  How  much  worse  off  are  you  with  clouds  if  you  use  the  2 

millimeter  oxygen  band? 

Staelin:  We  have  been  studying  that  question.  The  2^millimeter 

oxygen  band  has  an  advantage  that  a smaller  antenna  will  give  you 
comparable  resolution.  Roughly  speaking,  the  118-GHz  opacity  of 
a given  cloud  or  cloud  layer  is  roughly  a little  more  than  twice 
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what  it  would  be  at  60  GHz..  But  because  of  the  statistics  of  cloud 
opacity  as  a function  of  position  on  the  Earth,  it  appears  we  will 
not  be  affected  much  more  than  we  are  at  60  GHz.  To  evaluate  that 
exactly  will  be  difficult  because  there  are  no  good  statistics  that 
are  relevant. 

Unidentified  Speaker:  Do  you  see  any  advantage  in  going  to  shorter 

wavelengths,  higher  frequencies,  say,  millimeter  and  sub-millimeter 
waves  in  future  systems? 

Staelin:  I think  it  is  in  the  study  of  trace  constituents  that  we 

will  obtain  the  greatest  advantage,  because  their  line  strengths 
typically  increase  at  shorter  wavelengths.  There  is  another 
advantage,  particularly  for  the  initial  synchronous  satellite 
systems  in  that  smaller  antennas  will  yield  better  resolution.  I 
think  that  not  too  far  into  the  future  we  will  have  antennas  of 
almost  any  arbitrary  size  available.  So  that  advantage  will  dis- 
appear after  some  decades.  Although  the  insensitivity  to  clouds 
perseveres  into  the  short  millimeter  wavelengths,  it  does  degrade 
and  at  hundreds  of  gigahertz  even  ice  clouds  become  much  more 
significant. 

Wark:  There's  one  thing  that  seems  to  be  swept  under  the  rug  a 

lot  of  times  and  that  is  all  the  inverse  problems  performed  near 
the  surface.  There  is  clearly  a difference  between  the  sort  of 
thing  we  saw  Henry  Fleming  present  and  what  you  have  presented  in 
the  1000  to  800  millibar  range.  This  is  extremely  important  from 
a meteorological  standpoint  because  the  thing  becomes  very  bad 
there.  The  rest  of  the  sounding  is  correspondingly  bad.  Could 
you  comment  on  how  microwaves  affect  this  result  near  the  surface 
versus  your  method  of  retrieval? 

Staelin:  If  the  atmosphere  is  clear  and  the  surface  is  known, 

then  performance  depends  on  the  shape  of  the  kernel.  As  I mentioned, 
the  kernels  can  be  narrower  in  the  4 ym  band,  and  even  somewhat 
narrower  in  the  15  ym  band,  depending  on  the  temperature  profile. 

In  the  presence  of  clouds,  infrared  retrievals  will  degrade.  In 
terms  of  the  retrieval  technique,  we  have  used  primarily  linear 
regressions.  The  temperature  retrieval  errors  increase  near  the 
surface  because  we  are  comparing  our  3 -channel  retrievals  to 
rather  detailed  profiles  obtained  by  radiosondes  or  from  the  NMC 
analyses,  both  of  which  exhibit  inversions  and  other  sharp  dis- 
continuities near  the  sur'face,  which  no  broad  kernel  could  recover. 

Wark:  But  your  results  of  the  thousand  millibars  were  considerably 

worse  than  anything  we  have  seen  here. 

Staelin:  That  is  largely  because  we  did  not  put  in  the  surface 

temperature.  Had  we  used  the  NMC  forecast  surface  temperature 
or  an  infrared  surface  temperature  estimate,  our  results  would 
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have  been  better. 


Wark:  All  right,  then  it  is  the  method;  not  the  consequence  of 

the  employment  of  microwaves.  It  is  the  method  of  retrieval  where 
you  do  not  employ  a piece  of  information? 

Staelin:  That's  right-  We  definitely  perform  better  if  we  know 

the  surface  temperature  either  from  an  infrared  surface  tempera- 
ture measurement  or  from  a priori  information,  etc. 

Sussklnd:  I will  mcike  a comment  about  some  of  the  previous  things. 

We  have  been  analyzing  some  of  the  information  contained  in  the 
HIRS  IR  data  and  SCAMS  data,  both  on  Nimbus  6,  and  with  regard 
to  some  of  the  questions  that  have  been  answered.  The  quality  of 
the  sounding  between  SCAMS  alone  and  HIRS  alone  is  very  comparable 
one  to  another.  In  the  presence  of  clouds,  if  it  is  completely 
overcast  you  are  out  of  the  game  in  the  infrared.  But,  say,  when 
it  is  80-85  percent  cloudy,  the  quality  of  the  soundings  is  com- 
parable. And,  likewise,  with  surface  temperatures.  The  quality  of 
the  soundings  in  the  lower  troposphere  is,  in  fact,  comparable. 

We  got  results  of,  say,  3 RMS  error  at  the  surface  in  the  micro- 
wave.  The  errors  are  largest  at  the  surface  and  at  850  mb  and  are 
a little  larger  in  the  microwave  than  in  the  infrared,  but  we 
interpret  this  to  be  due  to  some  problems  in  calculating  the  sur- 
face emissivity  exactly.  But  differences  are  very  small — ^maybe 
1/10  or  2/10  degrees  in  the  RMS  sense  between  the  microwave  and 
infrared-  And  we  have  come  to  the  conclusion  that  the  sounders  are 
quite  comparable  one  to  the  other — slightly  worse  errors  down  low, 
but  nothing  drastic.  I might  add  that  our  results  are  obtained  with 
no  additional  surface  information  but  the  observations  and  a 
climatology  guess. 

Staelin:  I think  one  has  to  be  careful  in  distinguishing  between 

comparisons  of  particular  instruments  and  comparisons  in  principle. 
In  other  words,  these  particular  satellites  have  both  infrared  and 
microwave  channels  which  might  be  chosen  differently.  Certainly 
with  a larger  budget  one  would  use  more  than  three  microwave 
temperature- sounding  channels.  Then  the  performance  limits  are 
ultimately  set  by  the  widths  of  the  kernels,  and  the  effects  of 
surfaces  and  clouds.  Surface  emissivity  effects  are  observable 
only  for  weighting  functions  that  peak  within  - 4 km  of  the  sur- 
face. Extreme  variations  for  the  4 km  channel  of  NEMS  and  the 
2-km  channel  of  SCAMS  were  approximately  1.5  K and  10  K,  respec- 
tively, in  brightness  temperature  as  the  satellite  passed  from 
ocean  to  land;  this  can  largely  be  compensated  by  use  of  window 
channels. 

Fraser:  Meteorologists,  of  course,  would  like  to  have  the  pres- 

sure measured  to  10/10  of  a percent  of  the  surface  pressure.  What 
development  efforts  are  underway  to  do  that  with  microwave? 
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Staelin:  No  major  efforts  to  my  knowledge. 

Fraser:  Because  it  is  so  important  and  you  indicated  that  it 

might  be  feasible.  Is  there  some  reason  for  this? 

Staelin:  I should  say  none  that  are  funded.  There  has  been 

effort  and  interest  ih  this  in  a number  of  quarters.  Somje  of  these 
techniques  have  been  used  in  astronomy.  Radar  pressure  sounding 
data,  for  example,  was  used  on  Venus  over  ten  years  ago  by  Barrett 
and  myself  to. support  the  contention  the  atmosphere  was  dense. 

Limb  sounding  measurements,  of  course,  have  also  been  used  in 
astronomy.  The  task  of  applying  these  techniques  to  actual 
working  instruments  involves  profound  engineering  problems. 

Desmond  Smith  in  Scotland  has  been  interested  in  this  problem  for 
many  years.  I don't  know  the  present  status  of  that  work.  I had 
one  student  write  his  thesis  on  this  subject.  But  there  has  been 
no  significant  funding.  Proposals  have  been  submitted,  but  other 
things  have  priority. 

Chahine:  One  of  your  former  students.  Dr.  Joe  Waters,  is  working 

on  the  development  of  pressure  sounder  at  JPL. 

Staelin:  I think  it  is  an  interesting  problem  and  obviously  of 

great  importance  if  one  can  be  successful.  More  effort  seems  to  be 
warranted. 
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APPLICATION  OF  STATISTICAL  INVERSION  TO 


GROUND-BASED  MICROWAVE  REMOTE  SENSING 
OF  TEMPERATURE  AND  WATER 
VAPOR  PROFILES 


E.  R.  Westwater  and  M.  T.  Decker 
NOAA/ERL/Wave  Propagation  Laboratory 


Surface-based  observations  of  downwelling  microwave  ther- 
mal emission  are  related  to  temperature  and  humidity  profiles 
via  a standard  integral  equation  of  radiative  transfer.  Both 
in  clear  and  in  cloudy  atmospheres , statistical  inversion^ 
techniques  can  be  used  to  retrieve  profiles  from  a data  vector 
of  brightness  observations  and  surface  meteorological  con- 
straints. For  the  clear  case,  illustrate  accuracy  pre- 
dictions and  profile  retrievals  for  (a)  single- frequency 
angular- scanned  data,  (b)  multi- frequency  angular  scanned  data, 
and  (c)  multi- frequency  zenith  data.  For  case  (c)  we  compare 
predicted  and  achieved  accuracies  in  a recently  conducted 
joint  NOAA-JPL  radiometric  experiment.  Finally,  we  present 
retrievals  of  cloud-contaminated  radiometric  data. 


I . INTRODUCTION 

The  continuous  measurement  of  temperature  and  h\imidity  in  the 
Earth's  boundciry  layer  is  an  important  requirement  in  some  areas 
of  meteorological  research.  Ground-based  microwave  radiometric 
measurements  of  temperature  structure  show  promise  of  meeting  this 
need  and  the  technique  has  been  investigated  by  several  groups 
(Refs.  1,  2g  3,  and  4) . Limited  information  on  the  moisture  pro- 
file is  also  radiometrically  available  (Refs.  5 and  6) . 

In  this  review,  we  outline  the  application  of  statistical 
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inversion  methods  to  a few  of  the  increasingly  complex  measuring 
techniques  that  have  evolved  over  the  years.  For  each  method , 
statistical  inversion  appears  capable  of  extracting  maxim-um  infor- 
mation from  the  measurements. 

The  extent  to  which  inversion  theory  applies  to  a problem 
depends  strongly  on  the  solution  of  the  direct  problem;  i.e.,  given 
the  profile,  can  we  calculate  the  measurements  to  within  the  experi- 
mental accuracy?  We  therefore  spend  some  time  discussing  the 
accuracies  of  microwave  thermal  emission  calculations. 

We  then  review  temperature  retrieval  results  from  single-  and 
multiple-frequency  angular  scanning  radiometers.  Finally,  we  con- 
clude with  recent  results  in  multi-frequency  sensing  of  temperature 
and  moisture  profiles. 

II.  MICROWAVE  ATTENUATION  AND  EMISSION  IN  CLEAR  AND  CLOUDY 

ATMOSPHERES 

Measurements  of  microwave  radiant  power  are  commonly 

expressed  as  an  equivalent  black  body  temperature,  or  brightness 

temperature,  T,  . Except  during  rain,  atmospheric  scattering  is 
b 

small  relative  to  absorption.  For  a nonscattering  atmosphere  in 
local  thermodynamic  equilibri\im,  the  downward  brightness  tempera- 
ture at  frequency  v is  given  by 

T,  (v)  = T^  T + / T a exp  (-  / a (s*)ds’)ds  (1) 

b b ^0^0^ 

where  T is  absolute  temperature  (K) , a is  absorption  coefficient 

—1  (ext)  ^ 

(km“  ) f T,  is  brightness  temperature  external  to  the  Earth's 

D 

atmosphere  (K) , is  transmission  through  the  atmosphere,  and  s 
is  path  length  from  the  receiver  to  emitting  volimie  (km) . In  the 
troposphere,  microwave  absorption  is  due  principally  to  molf^cular 
resonances  of  (60  GHz)  and  H^O  (22  GHz)  and  to  clouds  and  rain. 

In  general,  the  absorption  is  a strong  function  of  composition  and 
a weak  function  of  temperature.  Thus,  around  22  GHz,  the  emission 
varies  principally  because  of  variations  in  water  concentration, 
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whereas  at  60  GHz,  the  emission  from  the  well-mixed  constituent 

depends  mainly  on  temperature.  For  ground-based  applications,  the 
( ext} 

term  T describes  discrete  external  sources,  such  as  the  Sun 

b 

or  Moon,  and  the  continuiim  "big  bang"  contribution  of  2.9  K. 
Calculations  of  the  microwave  absorption  coefficient  are  shown  in 
Fig.  1. 


A.  Water  Vapor  Attenuation 

Water  vapor  attenuation  arises  from  the  rotational  transition 
at  22.235  GHz  and  the  nonresonant  contribution  of  submillimeter 
and  infrared  lines.  At  frequencies  below  100  GHz,  the  spectrum  is 
the  sum  of  a resonant  term  and  a contribution  from  the  higher  fre- 
cjuency  lines  that  varies  as  v^.  An  excellent  siommary  and  dis- 
cussion of  the  theoretical  experimental  basis  of  water  vapor  atten- 
uation calculations  are  given  by  Waters  (Ref.  7) . 


Fig.  i.  Microwave  atmospheric  absorption  in  clear  air, 
clouds,  and  rain,  a = absorption  coefficient,  e = attenuation 
coefficient,  (1  bar  = 100  kPa) 
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Eor  remote  sensing  of  water  and  temperature,  we  must  accu- 
rately calculate  water  vapor  emission  and  attenuation  from  profiles 
of  meteorological  variables.  Since  several  equations  to  calculate 
vapor  attenuation  exist  in  the  literature  (Ref.  7) , we  thought 
that  a comparison  of  mdre  recent  models  in  brightness  calculations 
would  be  at  least  suggestive  of  the  accuracy  to  which  the  clear  air 
direct  problem  is  solved  in  the  20-35  GHz  band.  Thus,  we  present, 
in  Table  1,  comparisons  of  calculations  of  brightness  temperature 
for  five  selected  radiosonde  profiles.  The  calculations  labeled 
BA  used  constants  derived  by  least  squares  from  the  absorption 
data  of  Becker  and  Autler  (Ref.  8) ; L labels  calculations  using 
constants  derived  by  Liebe  (Ref.  9)  using  dispersion  spectroscopy; 
finally  W labels  the  results  using  parameters  given  by  Waters 
(Ref.  7).  BA  and  L assume  the  Van  Vleck-Weisskopf  line  shape;  W 
uses  that  of  Gross-Zhevakin-Naumov.  When  compared  with  radio- 
metric  accuracies  that  approach  1 K,  the  agreement  is  not  com- 
pletely satisfactory,  especially  for  the  profiles  with  larger 
water  content.  If  one  accepts  the  more  recent  results  (L  and  W) , 
the  agreement  is  within  5%. 

B.  Oxygen  Attenuation 

Beginning  with  the  classic  theoretical  paper  by  Van  Vleck 
(Ref.  10)  on  microwave  absorption  by  molecular  a large  amount 
of  theoretical,  laboratory,  and  atmospheric  research  has  been 
devoted  to  the  understanding  of  attenuation  from  this  constituent. 

As  a consequence,  the  knowledge  of  0^  absorption  parameters  and  of 
related  pressure  broadening  theory  has  steadily  increased.  Two 
recent  advances  are  of  note.  The  first  is  the  development  of  the 
dispersion  spectroscopy  technique  by  Liebe  and  its  application  to 
the  determination  of  the  spectroscopic  parameters  of  (Ref.  11) . 
The  second  is  Rosenkranz's  (Ref.  12)  theoretical  attenuation  model 
which  accounts  for  overlapping  lines  in  the  oxygen  complex. 

As  in  the  previous  case  of  water  vapor,  we  wanted  to 
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TABLE  1 


Comparison  of  Calculated  Zenith  Brightness 
Temperatures  at  22  and  31  GHz  Resulting 
from  Various  Choices  of  Spectral 
Line  Parameters 


Pro- 

file 

Liquid 

water 

content 

BA 

22.235  GHz 
L 

W 

BA 

31.65  GHz 
L 

W 

1 

0.8 

23.4 

22.3 

21.4 

14.7 

15.0 

14.5 

2 

1.1 

28.6 

27-2 

26.1 

16.5 

16.9 

16.3 

3 

1-3 

31-1 

31.5 

30.1 

18.1 

18-5 

17.7 

4 

2.8 

68-2 

64-5 

61.5 

29.0 

30.1 

28.3 

5 

3.6 

74.9 

70.8 

67.5 

31.2 

32.4 

30.4 

determine  the  degree  to  which  contemporary  absorption  models  agreed 
with  each  other.  To  this  end,  we  performed  calculations  of  zenith 
brightness  using  three  absorption  models:  the  first  (RMC)  used 

the  Van  Vleck-Weisskopf  line  shape  with  constants  given  by  Reber, 
Mitchell  and  Carter  (Ref-  13);  the  second  (R)  used  Rosenkranz’s 
(Ref.  12)  line  shape  and  the  constants  given  in  his  paper;  the 
third  (L)  used  Liebe's  measurements  (Ref.  11),  the  Rosenkranz  line 
shape,  and  Rosenkranz 's  value  of  the  non-resonant  line  width.  The 
results  are  shown  in  Table  2. 

There  is  close  agreement  between  all  three  models  at  53.8  and 
55-5  GHz  and  R and  L agree  well  also  at  52.8  GHz.  The  difference 
between  RMC  and  the  other  two  at  the  most  transparent  channel, 

52.85  GHz,  is  almost  constant  4.2  K)  and  occurs  primarily 
because  of  the  difference  in  absorption  prediction  in  the  pressure 
range  100-500  mb. 

C.  Comparison  of  Clear  Air  Emission  Measurement  and  Calculations 

Five-channel  microwave  observations  were  taken  at  Pt.  Mugu, 
California  and  were  kindly  provided  by  B.  Gary  and  N.  Yamane  of 
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TABLE  2 


Comparison  of  Calculated  Zenith  Brightness 
Temperatures  (K)  in  the  Oxygen  Band 
Resulting  from  Various 
Absorption  Models 


Pro- 

file 

RMC 

52.85 

GHz 

R 

L 

RMC 

53.85 

-GHz 

R 

L 

RMC 

55.45 

GHz 

R 

L 

1 

194.8 

190.6 

191.3 

251.2 

250.8 

251.2 

281-5 

281.6 

281.7 

2 

191.5 

187.2 

187.7 

250.7 

250.3 

250.6 

280.8 

281.0 

281.0 

3 

193.8 

189.6 

190.0 

257.0 

256.8 

256.9 

289-7 

289.8 

289.9 

4 

190.6 

186.3 

187.0 

248.3 

248.0 

248.4 

278.7 

278.9 

278.9 

5 

189.9 

185.5 

186.0 

252.4 

252.1 

252.4 

287.4 

287.6 

287.6 

Jet  Propulsion  Laboratory  (JPL) . Their  radiometer  was  similar  to 
the  SCAMS  system  used  on  the  Nimbus  6 satellite  (Ref,  14),  During 
the  three-week  period  of  observations,  thrice-a-day  radiosondes 
obtained  standard  meteorological  soundings  of  temperature,  pres- 
sure, and  humidity.  Calculations  of  brightness  temperature  were 
made  by  using  the  constants  of  Becker -Autler  for  water  vapor,  and 
those  of  Liebe  for  The  comparison  of  measurements  and  cal- 

culations are  shown  in  Table  3.  Considering  the  difficulties  in 
making  absolute  radiometric  measurements  and  in  observing  the  same 
volTime  of  air  with  radiosonde  and  radiometer,  the  agreement  in  the 
band  is  quite  good.  Note,  however,  the  relatively  large  vari- 
ance at  the  22  and  31  GHz  channels.  Because  of  the  difficulty  in 
obtaining  reliable  direct  measurements  of  humidity  profiles  by 
radiosondes  (Refs.  15  and  16) , these  differences  may  not  be  caused 
by  incorrect  absorption  coefficients. 

D.  Attenuation  by  Clouds  and  Rain 

Attenuation  from  a distribution  of  spherical  particles  of  known 
dielectric  properties  can  be  calculated  by  classical  electromag- 
netic theory.  Depending  on  the  ratio  of  particle  size  to 
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TABLE  3 


Average  and  Root-Mean-Square  (rms)  Differences 
(K)  between  Measured  and  Calculated 
Brightness  Temperatures 
(N  = 24  Radiosonde 
Observations) 

Frequency  (GHz) 


Difference 

22.234 

31.65 

52.85 

53.85 

55,45 

Average 

2.5 

1.7 

0.2 

0.1 

-1.3 

Root-mean- square 

3.4 

1.8 

1.8 

0.8 

1.4 

Percent 

12.6 

11.2 

0.9 

0.3 

0-5 

wavelengthr  simplicity  or  complexity  prevails.  In  the  domain 
here  this  ratio  is  small,  called  the  Rayleigh  region,  attenuation 
is  independent  of  size  distribution  and  is  directly  proportional 
to  total  mass  of  droplets.  In  addition,  scattering  is  negligible 
relative  to  absorption.  For  large  particles,  Mie  theory  must  be 
used,  and  attenuation  depends  on  size  distribution  in  both  absorp- 
tion and  scattering.  For  our  purposes,,  we  will  consider  water 
clouds  with  modal  radii  less  than  50  ym  to  be  in  the  Rayleigh 
region  for  frequencies  less  than  100  GHz.  Calculations  of  water 
absorption  for  a cloud  liquid  water  content  of  =0.1  g/m^ 

are  shown  in  Fig.  1.  Depending  on  the  frequency,  spherical  ice 
particles  absorb  from  one  to  two  orders  of  magnitude  less  than  an 
equivalent  amount  of  water. 

In  contrast  to  nonprecipitating  clouds,  rain  (and  hail)  both 
scatters  and  absorbs  microwave  energy.  The  attenuation  coefficient 
e must  be  calculated  from  Mie  theory  for  both  absorption  and  scat- 
tering coefficients.  Calculations  of  e for  a moderate  rain  of 
12.5  mm/hr  are  also  shown  in  Fig.  1.  We  assumed  a Laws  and 
Parsons  (Ref.  17)  size  distribution  for  this  rain  rate  (liquid) 
water  content  = 0.6  g/m^) . It  is  clear  that  rain  dominates  all 
other  sources  of  attenuation  except  in  the  vicinity  of  the  oxygen 
complex . 
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The  atmospheric  thermal  emission  spectrum  in  the  presence  of 
a cloud  can  differ  considerably  from  that  obtained  under  clear 
conditions.  In  detail,  the  amount  of  contrast  depends  on  the 
height  profiles  of  temperature,  humidity,  and  liquid  content. 
However,  the  largest  pontribution  to  the  difference  is  the  liquid 
thiclcness  with  the  emission  being  relatively  insensitive  to  the 
height  and  geometric  thickness  of  the  cloud  (Ref.  18) . Table  4 
shows  estimated  rms  differences  between  clear  and  cloudy  bright- 
ness temperatures  for  the  July  climatology  of  Pt.  Mugu,  California. 
These  differences  clearly  show  the  need  for  cloud  correction. 
Similar  calculations  for  ocean  climatologies  indicated  rms  dif- 
ferences about  twice  as  large  as  those  of  Table  4. 

III.  STATISTICAL  INVERSION  OF  GROUND-BASED  MICROWAVE  RADIOMETER 
DATA  TO  RECOVER  VERTICAL  TEMPERATURE  PROFILES 

Minimimi-variance  statistical  inversion  is  used  to  estimate  a 
parameter  vector  p from  a data  vector  d according  to  the  well- 
known  prescription 

^ = <p>  + <p*  <d’  (2) 

In  Eq.  (2),  p is  the  estimator  of  p,  <•>  refers  to  ensemble 
averages  over  joint  distributions  of  p and  d,  and  primes  denote 


TABLE  4 


Rms  Differences  between  Clear  and  Cloudy 
Zenith  Brightness  Temperatures 


Frequency, 

GHz 

Rms  difference, 
K 

22.235 

3.6 

31.65 

7.6 

52.85 

6.4 

53.85 

2.0 

55.45 

0.1 
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departures  from  mean  values.  The  assumptions  and  derivations 
leading  to  this  equation  are  given  by  Rodgers  (Ref.  19) - The 
covariance  matrix  of  this  estimate,  S is  given  by 

S ^ 5 < (p  - fl)  (p  - ^)'^>  = <p'  p'^>  - 

p-p  ~ ™ ^ ^ _ 

T T -1  T 

<p'  d*  > <d'  d'  > <d*  p*  >,  (3) 

The  ith  diagonal  element  of  S represents  the  residual  variance 

th  ^ ^ 

of  the  estimate  of  the  i parameter  and  is  a direct  measure  of 

solution  quality.  If  p represents  a discretized  profile  of  some 

variable,  then  the  matrix  trace  of  S ^ is  a useful  measure  of 

P-P 

overall  solution  quality.  Here  p usually  represents  vertical 
temperature  or  humidity  profiles  at  a sufficiently  dense  set  of 
height  coordinates,  and  d is  some  function  of  observed  brightness 
temperatures . 

The  application  of  these  equations  to  ground-based  (g.b.) 
sensing  of  meteorological  profiles  differs  in  several  respects 
from  the  corresponding  satellite  retrieval  problem: 

(a)  Satellite  retrievals  use  a priori  statistics  appropriate 
to  a latitudinal  region;  a ground-based  application  requires  only 
single-station  statistics.  Consequently,  the  a priori  variance  of 
the  g.b.  ensemble  is  usually  much  less  than  that  used  for  satel- 
lites. 

(b)  For  the  g.b.  problem,  direct  observations  of  the  desired 
profile  at  the  surface  can  usually  be  obtained.  This  can  be 
imposed  as  an  exact  constraint  on  the  inferred  profile,  which,  in 
addition,  further  reduces  the  a priori  variance  at  all  levels, 
since  we  are  now  averaging  over  an  ensemble  with  fixed  surface 
conditions. 

(c)  A practical  consequence  of  (a)  and  (b)  is  that  linear 
methods  are  frequently  appropriate,  because  of  the  relatively 
small  variation  about  the  initial  guess  profile. 
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A.  Temperature  Sensing  by  Inversion  of  Single-Channel  Angular- 
Scanned  Radiometric  Data 

Initially,  ground-based  temperature  sensing  was  attempted  by 
fixed-frequency  angular  scanning  methods  (Refs.  1,20,  and  21)  . A 
typical  set  of  weighting  functions  is  shown  in  Fig.  2.  Although 
these  weighting  functions  attain  their  maxima  at  the  surface, 
linear  combinations  of  them  can  be  made  to  peak  at  various  alti- 
tudes yielding  a spatial  resolution  that  degrades  with  altitude 
(Ref.  3). 

Weighting  functions  give  the  system  response  to  a delta- 

function  input;  another  meaningful  system  characterization  is  the 

total  variance  of  the  obseirvations,  and  its  partitioning  into  the 

contributions  from  all  relevant  meteorological  variables.  An 

example  of  this  analysis  is  shown  in  Fig.  3,  in  which  partial 

variances  in  T,  from  fluctuations  in  temperature,  relative  h\imidity, 
b 

and  pressure  are  shown  as  a function  of  elevation  angle.  Although 

this  channel  is  in  the  O band,  the  fluctuations  due  to  humidity 

o 

exceed  those  from  temperature  for  angles  greater  than  20  . 

The  reduction  in  "noise”  due  to  surface  constraints  is  shown  in 

Fig.  4,  where,  in  particular,  the  variance  in  T_  due  to  pressure 

b 

fluctuations  is  reduced  to  almost  zero.  However,  the  contamination 
of  the  temperature  "signal”  due  to  the  humidity  "noise"  is  still 
extensive  and  really  requires  a water  channel  for  its  removal. 

As  mentioned  earlier,  the  diagonal  elements  of  ^ are  a 
measure  of  solution  quality.  Plots  of  the  square  roots  of  these 
diagonal  elements  as  a function  of  altitude  indicate  the  expected 
standard  deviation  of  the  solution  over  an  ensemble  of  profiles. 

An  example  of  this  type  offplot  is  Fig.  5,  which  shows  theoretical 
retrieval  accuracy  for  several  choices  of  operating  frequency  for 
the  August  climatology  of  Salt  Lake  City,  Utah.  These  calculations 
predict  an  accuracy  of  somewhat  better  than  1 K up  to  3 km  in 
altitude.  Retrieval  of  profiles  from  radiometric  data  (clear 
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Temperature  Weighting  Function  Relative  Units 


Fig.  2.  Temperature  weighting  functions  for  ground-based 
angular  scanning  at  54,5  GHz, 


Fig,  3,  Contribution  to  fluctuations  in  brightness  tempera 
ture  at  52,5  GHz  from  temperature,  humidity,  and  pressure. 


Fig.  4.  Contribution  to  fluctuations  in  brightness  tempera- 
ture at  52.5  GHz  from  temperature,  humidity,  and  pressure. 
Constrained  surface  conditions. 


skies  only)  gave  accuracies  very  close  to  these  predictions  (Ref. 
21) . Examples  of  profile  recoveries  from  single-channel  angular 
scan  data  are  given  in  Figs.  6 and  7. 

B.  Temperature  Sensing  by  Inversion  of  Multi-Spectral  Angular- 
Scanned  Radiometric  Data 

Although  the  simplicity  of  a single-frequency  angular- 
scanning  radiometer  is  attractive , scanning  with  a multi-channel 
system  can  significantly  improve  retrieval  accuracy  (Ref.  22).  An 
example  of  the  amount  of  improvement  can  be  seen  in  Fig.  8 in  which 
theoretical  retrieval  accuracy  for  several  systems  is  shown  as  a 
function  of  altitude.  Table  5 gives  an  explanation  of  the  spec- 
tral combinations  of  Fig.  8;  combination  lA  refers  to  experi- 
mental noise  level  of  0.1  K and  represents  somewhat  of  a practical 
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Temperature  Standard  Deviation^  K 


Fig.  5.  Theoretical  accuracy  in  retrieving  vertical  temper- 
ature profiles  by  inversion  of  single- frequency  angular  scanned 
radiometer  data.  = assumed  instrument  noise  level, 

a priori  standard  deviation  for  fixed  surface  conditions. 


Fig.  6.  Temperature  profile  derived  from  single- frequency 
angular  scanned  radiometer  data  at  Cincinnati,  Ohio.  A priori 
statistics  from  Dayton,  Ohio.  T - radiosonde  profile.  T = mean 
profile  for  constrained  surface  conditions.  T = inferred  profile. 
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Fig.  7.  Temperature  profile  derived  from  single- frequency 
angular  scanned  radiometer  data  at  Raleigh-Durham,  North  Carolina. 

A priori  statistics  from  Greensborough,  North  Carolina  - radiosonde 
profile.  T = inferred  profile. 


TABLE  5 


Explanation  of  Spectral  Combinations 
Employed  to  Estimate  Temperature 
Profiles 


Combination 

Elevation  angle 
(deg) 

Input  data 

Frequency 

(GHz) 

1 

5 

54.5, 

55.5 

10 

54.5, 

55.5 

15 

54.5, 

55.5 

30 

53.5, 

54.5, 

55.5 

60 

52.5, 

53.5, 

54.5 

90 

52.5, 

53.5, 

54.5 

2 

90 

52.5, 

53.5, 

54.5,  55.5 
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Fig.  5.  Theoretical  accuracy  in  retrieving  vertical  temper'^ 
ature  profiles  from  various  combinations  of  multi- spectral  multi- 
angle radiometer  data,  Denver,  Colorado  (after  Ref*  20) . 
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limit  in  measurement  accuracy;  the  noise  levels  in  other  combina- 
tions were  obtained  from  comparison  of  measurements  and  calcula- 
tions of  brightness  temperatxire . The  multi-spectral  angular-scan 
combination  1 improves  the  retrieval  accuracy  over  the  single- 
frequency scans  by  a margin  of  about  two  to  one . 

Typical  excimples  of  the  low-altitude  temperature  structure 
that  can  be  recovered  is  shown  in  Figs.  9,  10,  and  11,  which  show 
retrievals  of  a low  altitude  elevated  inversion,  a ground-based 
inversion,  and  a low-level  super-adiabatic  profile.  Note  expe- 
cially  the  improvement  in  the  recovery  of  the  elevated  inversion 
over  the  single-frequency  result  of  Fig.  7. 

IV.  RECENT  RESULTS  IN  INVERSION  OF  GROUND-BASED  MULTI -SPECTRAL 
RADIOMETRIC  DATA  TO  INFER  TEMPERATURE 
AND  HUMIDITY  PROFILES 

We  are  currently  investigating  the  feasibility  of  sensing  of 
temperature  profiles  from  an  ocean  data  buoy-mounted  radiometer. 
Because  of  buoy  motion,  the  previously  discussed  angular  scanning 
techniques  are  not  practical;  in  addition,  the  cloud  problem  dic- 
tates the  need  for  two  moisture  channels  to  sense  and  to  correct 
for  clouds.  Cloud  correction  algorithms  appropriate  for  microwave 
passing  sensing  have  been  published  by  Westwater,  et  al., 
Rosenkranz,  et  al.,  and  Fowler,  et  al.  (Refs.  18,  23,  and  24). 

The  National  Weather  Service  desires  that  the  temperature  of 
layers,  100  mb  in  thickness,  be  determined  with  accuracy  ± 1 K. 
Although  temperatures  of  the  lowest  two  layers  could  be  useful, 
retrievals  to  500  mb  are  wanted.  Predictions  for  ocean  clima- 
tologies, using  Eq.  (3) , showed  that  under  clear  conditions,  a 
three- frequency  zenith  looking  radiometer  could  meet  the  lower 
altitude  requirements  and  could  be  reasonably  close  to  those  at 
the  higher  altitudes  (Ref.  18)  . Accuracy  predictions  are  shown 
in  Fig.  12  for  several  systems  under  consideration;  the  tempera- 
ture weighting  functions  for  system  C are  shown  in  Fig.  13. 
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Fig.  9.  Inferred  and  radiosonde  temperature  profiles  from 
multispectral  multi-angle  radiometer  data  for  0400,  April  28, 

1971.  A priori  statistics  from  Denver,  Colorado.  (After  Ref.  20.) 


BOULDER,  COLO. 


Fig.  10.  Inferred  and  radiosonde  temperature  profiles  from 
multi-spectral  multi-angle  radiometer  data  for  0900,  April  28, 
1971. 
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Fig.  11.  Inferred  and  radiosonde  temperature  profiles  from 
multi-spectral  multi-angle  radiometer  data  for  1500,  April  30,  1971. 


Standard  Deviation,  K. 


Fig.  12.  Comparison  of  several  radiometric  systems  in 
retrieving  temperature  layers  of  100-mb  thickness  during  clear 
conditions  at  ocean  climatologies.  Five-station  rms  average. 
Instrumental  error  = 0.5K.  Systems:  A (52. 8,  55.4,  58.8,  20.6, 

31.65  GHz);  B(54.0',  55.4,  58.8,  20.6,  31.65  GHz);  C(52.8,  54.0, 
55.4,  20.6,  31.65  GHz);  D(55.4,  58.8  GHz). 
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Temperature  Weighting  Function  Relative  Units 

.1 

Fig.  13-  Temperature  weighting  functions  for  ground-based 
sensing  by  SCAMS  radiometer. 


As  mentioned  in  Section  II. D,  both  the  52.85  GHz  and  the 
53.85  GHz  channels  require  cloud  correction.  The  results  shovm 
in  Fig.  14  indicate  that  with  the  addition  of  the  two  water  cor- 
recting channels,  and  with  a careful  choice  of  frequency  in  the 
band,  retrieval  accuracies  approaching  those  in  clear  air  can  be 
obtained.  On  the  basis  of  the  calculations  shown  in  Figs.  12  and 
14,  the  radiometric  system  C (SCAMS)  was  chosen  for  experimental 
verification  of  the  temperature  sensing  capability  of  the  buoy- 
based  system. 

The  experimental  program  to  confirm  the  theoretical  predic- 
tions is  being  conducted  jointly  by  National  Oceanic  and  Atmospheric 
Administration  (NOAA)  and  the  Jet  Propulsion  Laboratory.  During 
a three-week  measurement  period  in  March  1976,  at  Point  Mugu, 
California,  21  suitable  radiometer -radio  sonde  obseirvations  were 
obtained.  Although  some  of  the  data  were  taken  when  there  were 
visual  observations  of  clouds,  the  radiometrically  estimated  liquid 
content  was  less  than  0.02  mm;  hence,  the  clear  air  retrieval 
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Standard  Deviation^  X 


Fig,  14.  Comparison  of  several  radiometric  systems  in 
retrieving  temperature  layers  of  100-mb  thickness  during  cloudy 
conditions  (liquid  thickness  < 4 mm  . Five-station  rms  aver- 

age. Instrumental  error  - 0.5  K.  Systems:  A(S2.8f  55.4,  58.8, 
20.6,  31.65  GHz);  B(54.0,  55.4,  58.8,  20.6,  31.65  GHz);  C(52.8, 
54.0,  55.4,  20.6,  31.65  GHz);  D(55.4,  58.8  GHz). 


algorithms  were  used  exclusively.  The  a priori  data  base  was  two 
years  of  twice’  daily  soundings  during  February,  March  and  April, 
taken  in  1973-1974.  The  statistical  summary  of  experiment  results 
are  compared  with  a priori  predictions  in  Fig.  15.  Although  the 
observed  variability  of  the  21  profiles  about  the  three-month 
a priori  is  somewhat  greater  than  the  theoretical  average,  the 
achieved  retrieval  accuracies  are  in  close  agreement  with  pre- 
dictions. It  is  somewhat  unusual  that  theory  predicts  (a)  no 
reduction  in  variance  at  about  750  mb  above  the  surface,  and 
(b)  a modest  reduction  in  variance  above  this  level.  The  experi- 
ment results  confirm  this  prediction. 

Low  resolution  information  on  the  water  vapor  distribution  is 
also  contained  in  the  five-channel  measurements.  Typical  water 
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Fig,  15.  RMS  accuracy  in  retrieval  of  100-mb  layer  averaged 
temperature r Pt.  Mugu,  California^  March  1976f  21  profiles . 

vapor  weighting  functions  for  the  SCAMS  system  are  shown  in 
Fig.  16.  Statistical  retrieval  algorithms  were  again  applied  to 
the  21  sets  of  radiometer  observations  to  infer  water  vapor  pro- 
files. The  sTjmmary  of  results  is  shown  in  Fig.  17,  and  typical 
temperature  and  humidity  retrievals  are  given  in  Figs.  18  and  19. 

Another  set  of  observations  was  taken  by  JPL  at  Pt-  Mugu  in 
July  1976.  In  all,  22  concurrent  radiometric  and  radiosonde 
observations  were  obtained.  The  statistical  retrieval  algorithm, 
in  Eg.  (2) , was  used  to  estimate  the  cloud  liquid  water  content 
(LWC)  of  each  of  the  profiles.  As  mentioned  earlier,  we  classify 
a profile  as  cloudy  if  the  estimated  LWC  is  greater  than  0.02  mm 
(a  LWC  of  0.02  mm  will  give  a 1 K change  in  brightness  temperature 
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Water  Density  Weighting  Function^ 
K/G/M3/km 


Fig.  16.  Water  vapor  density  weighting  functions  for  ground- 
based  SCAMS  radiometer. 

POINT  RETRIEVALS  OF  ABSOLUTE  HUMIDITY 


Fig.  17.  Ems'  accuracy  in  retrievals  of  absolute  humidity , 
Pt.  Mugu,  California,  March  1976,  21  profiles. 
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data  (dashed  lines)  compared  with  concurrent  radiosonde  profiles 


in  the  window  channel) . With  this  classification,  17  of  the  22 
profiles  were  radiometrically  cloudy.  We  are  currently  investi- 
gating the  application  of  various  cloud  correction  algorithms  to 
these  data.  However,  with  a very  simple  correction  method, 
namely,  no  correction  at  all,  the  upper  two  channels  can  yield 
useful  retrieval.  Examples  of  retrievals  obtained  during  cloudy 
conditions  are  shown  in  Figs.  20  and  21,  and  the  statistical  com- 
parison of  all  22  profiles  is  given  in  Fig.  22.  Although  there  is 
a substantial  reduction  in  variance  in  the  region  from  50  to  150  mb 
above  the  surface,  it  is  likely  that  cloud  correction  can  reduce 
this  variance  still  further. 


Fig.  20.  Example  of  temperature  profile  retrieved  from  two 
oxygen  channels  during  cloudy  conditions. 
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V.  CONCLUSIONS 


Statistical  retrieval  algorithms  have  proven  to  be  quite  use- 
ful in  ground-based  radiometric  sensing,  not  only  in  profile 
recovery,  but  in  making  a reasonable  a priori  prediction  of 
measurement  achievability.  Thus,  we  feel  we  can  confidently  answer 
questions  relevant  to  system  design,  such  as 

a.  what  is  the  climatological  variation  of  retrieval  accuracy? 
b-  for  a given  number  of  channels,  what  is  the  optimum  loca- 
tion of  measurement  ordinates? 

c.  what  noise  levels  are  required  to  give  specified  retrieval 
accuracies? 

We  are  currently  investigating  extensions  of  the  statistical  tech- 
nique to  the  microwave  cloud  problem. 
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SYMBOLS 

generalized  data  vector  (Eqs.  (2)  and  (3)) 
extinction  coefficient  of  rain^  km”^  (Eq.  (i) ) 
generalized  parameter  vector  (Eqs.  (2)  and  (3)) 
statistical  estimator  of  p (Eqs.  (2)  and  (3)) 
atmospheric  pressure^  mb  (Figs.  1,  3 and  4) 
relative  humidity  (Figs.  3 and  4) 
rain  rate,  mm/hr  (Fig.  1) 

path  length,  km  from  receiver  to  emitting  volume 
(Eq.  (2)) 

covariance  matrix  of  random  vector  q (Eq.  (3)) 
absolute  temperature,  K (Eq.  (1),  Figs.  1,  3 to  7) 
average  absolute  temperature  (Fig.  6) 
brightness  temperature,  K (Eq.  (1)  , Figs.  3 cind  4) 
brightness  temperature,  X external  to  the  Earth's 
atmosphere  (Eq.  (1)) 

2 

variance  in  brightness  temperature,  K (Figs.  3 and  4) 
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V[Tj^:P] 


a 


V 


V 


P 

liquid 

P 

vapor 

a 

a 

e 


variance  in  brightness  temperature,  due  to  fluc- 

tuations in  atmospheric  profile  parameter  p (Figs.  3 
and  4) 

absorption  coefficient,  km  ^ (Eq.  (1),  Fig.  1) 

frequency,  iSHz  (Eq.  (1)). 

density  of  liquid  water,  g/m^  (Fig.  1) 

absolute  humidity,  g/m^  (Fig.  1) 

standard  deviation  (Fig.  5) 

standard  deviation  of  instrumental  noise,  K (Fig.  5) 
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DISCUSSION 


Susskind:  When  you  do  your  analysis  of  the  SCAMS  data,  dp  you 

subtract  or  do  you  find  any  bias  in  the  brightness  temperatures? 

I know  Dave  Staelin  mentioned  there  was  the  possibility  of  a 2^ 
bias  in  the  measur emen4:s . And  when  we  do  our  analysis  we  have  to 
subtract  something  from  the  brightness  temperatures. 

Westwater:  On  the  first  or  the  second  slide  that  I showed,  we 

did  have  a small  bias  in  the  upper  channel — a 54.5  channel  and 
this  was  a bias  of  about  1.3^.  The  biases  in  the  other  two  chan^ 
nels  were  essentially  entirely  below  the  noise  levels.  These  were 
in  the  order  of  about  2/10  of  a degree.  Now  with  resepct  to  that 
question,  we  also  did  retrieval  with  subtracting  a bias  and 
retrievals  with  just  the  raw  data.  And  for  the  temperature 
retrievals  it  made  very  little  difference  in  the  RMS  errors.  But 
there  is  somewhat  of  a bias  in  this  in  the  data. 

Susskind:  I wonder  if  I could  address  this  quickly  to  Dave  Staelin. 

You  mentioned  that  there  was  a bias  of  approximately  2^  with  the 
SCAMS  measurement.  Is  that  still  there?  Namely,  that  due  to 
instrument  calibration. 

Staelin:  There  are  two  origins  for  bias  in  the  satellite  instru- 

ments. One  is  the  instrximent  calibration.  We  recalibrated  in 
orbit  using  NMC  data  yielding  corrections  on  the  order  of  a degree 
or  so.  There  is  another  correction  in  the  case  of  our  data  from 
both  the  Nimbus  5 and  6 satellites,  which  I did  not  discuss,  and 
that  is  the  transmittances . Either  they  have  systematic  errors 
which  are  larger  than  anything  that  has  been  indicated  in  the 
laboratory  spectroscope  data,  to  date,  or  the  radiosondes  have 
systematic  errors  which  are  larger  than  their  manufacturer  would 
probably  like  to  accept.  This  is  still  a residual  effect  on  the 
order  of  one  or  two  degrees.  But  we  have  yet  to  track  down  its 
origin.  It  can  be  removed  empirically  and  is  not  a problem  now. 

Susskind:  We  do  find  roughly  a systematic  difference  between 

the  observations  and  our  ability  to  calculate  them  of  approximately 
2^.  It*s  hard  to  tell  if  it  is  in  the  measurement  or  in  our 
ability  to  calculate  them. 

Westwater:  There  is  one  other  parameter  in  the  Rosencrantz  theory 

that  is  quite  difficult  to  measure  by  Liebe's  laboratory  dis- 
persion technique.  This  does  not  affect  the  calculated  radiances 
at  the  strongly  attenuating  frequencies.  However,  at  a frequency 
of  around  50  GHz  or  52  GHz,  they  are  much  more  sensitive  to  this 
parameter  that  was  not  determined  well  from  Liebe's  measurements. 

In  our  calculations,  we  used  the  value  of  the  nonresonant  param- 
eter that  was  given  by  Rosencrantz  himself  and  it  seemed  to  give 
a much  better  agreement  with  our  measurements. 
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Fleming:  Would  you  care  to  comment  on  just  how  you  adjust  for 

clouds  when  you  sense  them? 

Westwater:  The  technique  that  we  originally  tried  was  to  estimate 

the  equivalent  clear  air  brightness  from  the  entire  set  of  radiance 
obse3Tvations  cind  then  use  the  equivalent  clear  brightness  in 
retrieval.  However,  we  ran  into  difficulties  in  applying  this 
technique  to  the  52.8  GHz  channel,  so  that  the  retrievals  you  saw 
were  uncorrected  brightness  measurements  at  53.8  and  55.45  GHz. 

We  are  still  trying  to  unscramble  the  eggs  again  and  what  has 
happened  to  the  cloud  correction  at  the  lower  attenuating  oxygen 
channel. 

Fraser:  Perhaps  I misunderstood  your  last  slide.  I understood 

you  to  say  that  you  were  quite  pleased  with  the  water  vapor 
retrieval.  But,  as  I read  the  last  slide,  the  theoretical  and 
the  experimental  standard  deviations,  which  I interpret  as  devia- 
tions in  the  radiosondes  observations,  except  for  50  to  70  milli- 
bars above  the  ground,  were  essentially  the  same.  If  they  are, 
then  why  are  you  so  pleased  with  experimental  data? 

Westwater:  The  slide  I showed  at  the  end  was  the  clear  air  data 

that  was  taken  in  March.  And  at  least  the  total  integrated  water 
contents  agreed  quite  well  with  that  estimated  by  the  radiosondes. 
However,  in  the  period  in  July  we  did  not  obtain  adequate  agree- 
ment, primarily  because  of  the  cloud  correction  difficulties  that 
I mentioned  before.  One  of  the  reasons  I am  really  pleased  with 
the  data  is  we  really  expected  to  get  one  parameter  and  one  param- 
eter alone.  That  would  just  be  the  total  integrated  water  con- 
tent. The  amount  of  structure  shown  in  the  retrievals  was  an 
order  of  magnitude  greater  than  anything  I would  have  expected  or 
that  anyone  would  have  expected  from  a weighting  function  which  is 
essentially  constant  with  altitude.  There  is  very  little  structure 
in  the  weighting  function  itself. 
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INVERSION  METHODS  IN  TEMPERATURE  AND  AEROSOL 


REMOTE  SOUNDING:  THEIR  COMMONALITY  AND 
DIFFERENCES,  AND  SOME  UNEXPLORED 
APPROACHES 


Alain  L.  Fymat 
Jet  Propulsion  Laboratory 
California  Institute  of  Technology 


Departing  from  conventional  research  lines,  this  paper 
considers  in  parallel  the  two  remote  sensing  problems  of 
temperature  profiling  and  aerosol  characterization  (complex 
refractive  index,  size  distribution)*  These  problems  are 
formally  identical  and  differ  only  in  the  explicit  form  of 
the  source  function  which,  for  aerosols , includes  contribu- 
tions from  both  single  and  multiple  scattering  processes. 

The  functional  dependence  on  the  desired  atmospheric  param- 
eters, however,  is  considerably  more  complex  in  the  case  of 
aerosols . Both  problems  are  essentially  nonlinear.  However, 
when  the  observables  are  the  spectral  extinction  or  the 
single  scattering  of  the  source  radiation,  the  associated 
problem  is  completely  analogous  to  the  linearized  tempera- 
ture inversion  problem;  viz.  the  solution  of  a first-kind 
Fredholm  integral  equation  must  be  obtained.  Four  questions 
attach  to  such  an  equation:  existence , uniqueness,  stability 
and  construction  of  the  solution,  which  are  all  analyzed. 
Methods  for  obtaining  the  solution  of  the  linear  problem  are 
classified  following  three  main  categories  (i)  derivation  of 
properties  that  all  solutions  satisfy,  which  must  then  be 
properties  of  the  actual  solution;  (ii)  regularization  of 
the  ill-posed  problem;  and  (Hi)  data  changes  within  their 
domain  of  uncertainty  in  order  to  avoid  the  basic  instability. 
A number  of  unexplored  methods  (e.g.,  reduction  to  a second- 
kind  equation,  singular  value  and  other  decompositions, 
invariant  imbedding,  etc.)  are  indicated.  Solutions  of  the 
nonlinear  problem  are  also  classified.  Lastly,  a two-step 
strategy  is  proposed  for  retrieving  first  the  complex  refrac- 
tive index  of  aerosols,  in  a manner  that  is  essentially 
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independent  of  their  size  distributionr  and  then  the  size 
distribution  itself*  The  first  step  involves  the  inversion 
of  spectral  extinction  ratios  using  the  author’s  Minimization 
Search  Method^  while  the  second  step  rests  on  an  analytical 
integral  transform  of  light  scattered  within  a narrow  for- 
ward cone* 

I . INTRODUCTION 

Vertical  temperature  profiling  and  aerosol  microstzncture 
reconstruction  have  generally  formed  two  separate  branches  of 
research  in  remote  sensing  theory-  This  separation  also  exists  in 
the  retrievals  of  the  constant  concentration  of  thermally  active 
gases  and  the  complex  refractive  index  of  aerosols-  While  in  each 
case  of  these  two  situations,  the  underlying  physical  processes 
are  certainly  different,  the  corresponding  mathematical  inversion 
problems  are  nevertheless  very  similar-  Thus,  a greater  cross- 
fertilization between  these  two  traditionally  divided  activities 
can  only  result  in  advances  in  the  solutions  to  these  problems. 

In  fact,  one  should  proceed  further  and  even  explore  whether  the 
physical  basis  in  one  area  can  be  used  advantageously  in  the  other 
area.  For  example,  measurement  of  the  in-  and  out-of-band  com- 
ponents of  Rayleigh  scattering  by  an  atmospheric  absorber  of  known 
density  such  as  O^,  can  provide  the  altitude  distribution  of 

molecular  absorption  from 'which  the  temperature  profile  can  sub- 
sequently be  recovered.  Likewise,  Mie  scattering  by  aerosol  strati- 
fications can  locate  the  altitudes  of  temperature  inversions 
trapping  these  particulates-  Information  derived  from  these  and 
similar  scattering  techniques  can  be  used  to  improve  temperature 
retrievals  in  the  same  way  as  temperature  and  gaseous  composition 
determinations  can  help  in  the  investigation  of  aerosols. 

The  aim  of  this  article  is  to  set  forth  the  commonality  and 
differences  between  the  methodologies  that  have  been  developed,  or 
that  can  be  employed,  in  the  remote  sounding  of  temperature  and 
aerosols.  The  following  section  provides  the  equations  of  remote 
sens.'ng,  appropriate  for  a variety  of  viewing  geometries,  in  both 
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their  differential  and  integral  formulations,  and  expressions  for 
the  internal  and  external  field  sources-  Section  III  formulates  the 
temperature  sounding  problem  in  the  infrared  and  microwave  regions 
of  the  spectrim  with  analyses  of  the  weighting  functions  and  of  the 
nature  of  the  associated  mathematical  problem.  Section  IV  considers 
the  particulate  sounding  problem  for  reflected,  directly  and  dif- 
fusely transmitted  radiation  at  low  and  large  scattering  angles, 
and  provides  a tabular  analysis  of  the  nature  of  the  associated 
inverse  problems.  Section  V analyzes  the  nature  of  the  problem  of 
finding  a solution  to  linear  first-kind  Fredholm  integral  equations 
in  terms  of  existence,  uniqueness,  stability,  and  construction  of 
the  solution.  It  also  provides  a preliminary  classification  of 
methods  of  solution  of  such  equations.  The  corresponding  classifi- 
cation for  nonlinear  methods  is  given  in  Section  VI.  The  final 
section  proposes  a two-step  strategy  developed  by  the  author  for 
solving  the  particulate  sounding  problem.  There,  the  complex  refrac- 
tive index  would  be  retrieved  from  spectral  extinction  ratios  while 
the  particle  size  distribution  would  be  recons tjruc ted  from  angular 
forwardly  scattered  radiances  at  a given  wavelength. 

II.  EQUATIONS  OF  REMOTE  SENSING 

Consider  the  situation  represented  by  Fig.  1:  a slab  atmos- 
phere bounded  between  altitudes  z^  and  z^f  or  optical  depths, 
and  0,  respectively.  Any  point  P in  the  atmosphere  is  located  in 
a cartesian  frame  of  reference  by  its  spherical  coordinates:  r = 
radius  vector,  0 = zenith  angle  measured  from  the  z-axis,  and 
cf)  = azimuth  angle  measured  from  the  x-axis.  Upward  directions  are 

I 

denoted  Q = (a,  0,  y) ? likewise  downward  directions  are  represented 
by  E (a,  3,  -y)  , where  a = sin  0 cos  (j),  3 = sin  0 sin  (j)  and 
y = cos  0 are  direction  cosines.  The  equations  of  remote  sensing 
are  derived  from  the  radiative  transfer  equation  (RTE)  expressing 
the  principle  of  energy  conservation  within  an  elemental  atmospheric 
volimie.  They  can  be  obtained  in  either  a differential  or  an  inte- 
gral form,  as  follows. 
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Fig.  1.  Illustrating  the  geometry  for  remote  sensing. 

A.  Differential  Formulation 

In  its  most  general  form,  the  monochromatic  RTE  at  frequency 
V for  either  upwelling  or  downwelling  radiation  is  written  as 

(sr  • V)  I*  - -K^(i;  - j^)  (1) 

where  the  specific  intensity  (or  radiance)  I,  the  sour ce-fxinct ion 

J,  and  the  volume  extinction  coefficient  K are  all  functions  of 

->■  

location  and  direction:  F = F (r;  F = I,  J,  K.  The  differen- 

tial  operator  (Q  • V)  is  specified  by  the  geometry  of  the  atmos- 
phere; various  relevant  expressions  of  this  operator  are  summarized 
in  Table  1.  It  is  usually  assumed  that  K is  either  directionally 
averaged  or  independent  of  direction.  The  same  assximption  is 
separately  valid  for  the  absorption  coefficient,  k^,  and  the 
scattering  coefficient,  a^,  the  sum  of  which  equals  K. 


432 


TABLE  1 


Various  Forms  of  th^  Differential  Transfer 
Operator  for  Geometries  of 

Interest  to  Remote  Sensing 


ATMOSPHERIC  GEOMETRY 


la.  SPHERICAL 
(TWILIGHT, 
LIMB. 

TERMINATOR) 


lb.  (ORIGIN  AT 
EARTH'S 
CENTER  AND 
Z PARALLEL 
TO  LOCAL 
VERTICAL) 


DIFFERENTIAL  TRANSFER  OPERATOR 

“It  ^ ^ 

- (l-/i^)^^^  |^COS0  ^ + Sin0  ± JU-  COS0 


± 


ar 


IimLa 

r 


(1 V) 


1/2 


(I'/^O^) 


1/2 


cos  0 


a 

9^0 


1-/^ 


2.  SPHERICAL 
SYMMETRY 

(a/a^o  = a/d0  3 o) 


3.  AXIAL  SYMMETRY 
(M  - 1) 


± 


ar 


± 

9/i 


+ 


d_ 

ar 


4. 


PLANE-PARALLEL 

(e. g. , a/ax  = a/ay  = o) 


* 


Equation  (1)  is  to  be  solved  subject  to  appropriate  boundary 
conditions  on  the  radiation  field  at  the  top  I (z  = or  t = 0) 
and  the  bottom  I^(z  = or  t = x^)  of  the  atmospheric  slab. 
Depending  on  the  wavelength  region  utilized  and  on  assumptions  on 
the  problem,  Eq.  (1)  can  either  be  a scalar  or  a four-vector 
equation.  The  latter  situation  involves  Stokes*  representation 
of  a light  vector^  but  it  may  be  noted  that  other  representations, 
such  as  a four-coherency  vector  or  a 2 x 2 Jones*  matrix, can  be 
used  as  well.  In  general,  k,  o and  K are  taken  to  be  scalars.  In 
the  following  sections,  we  shall  limit  ourselves  to  the  case  of  a 
plane-parallel  stratified  atmosphere  for  which  the  RTE  is  simply 
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± M I~(z;  y,  (p)  = -K(z)[l"(z;  y,  (j>)  - J(z;  y,  (j))  ] (la) 

oZ 

Alternatively,  introducing  the  normal  optical  thickness - 


T 

V 


(z) 


K (z)dz 
v' 

•'z 


we  also  have 

± y y,  <j))  = J(x;  y,  <|)) 


+ 

I (t;  y,  <{)) 


(lb) 


(2) 


(In  these  equations  and  in  the  following,  the  subscript  v is 
dropped  for  convenience.  It  will,  however,  be  indicated  when  it 
occurs  for  the  first  time.) 


B . Integral  Formulation 

The  integral  formulation  is  simply  obtained  by  formal  inte- 
gration of  the  RTE.  For  example,  by  integration  of  Eq.  (2)^ 


I (x;fi)  = I (Xj^;fi)e  ^ 


-(s  - T)/y 

J(s;  Q)e  ^ (3a) 


— — T / li 

I (x;  J^)  = I (0;  f2)e  ^ + J(s;  J^)e 


-(x  - s)/y 


ds 

y 


(3b) 


which  provide  the  upward  and  downward  radiation  fields  at  any  arbi- 
trary level  T in  the  atmosphere.  The  first  term  on  the  right  side 
of  either  of  these  last  equations  represents  the  boundary  con- 
tribution while  the  integral  term  provides  the  atmospheric  contri^ 

bution  proper.  From  these  equations,  one  obtains  immediately  the 

+ 

field  emerging  from  the  top,  I (0;  , or  reaching  the  bottom, 

“ ->■ 

I bhe  atmosphere.  In  particular,  for  an  infinitely 

thick  medium  (x^  = °") , 

— s/y 

l’*’(0;  n)  = J(s;  fi)e  — (3c) 

■'O  ^ 
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C.  Internal  and  External  Field  Sources 


The  explicit  solution  of  the  RTE,  in  either  its  integral  or 
its  differential  form,  depends,  of  course,  on  assumptions  on  the 
nature  of  the  source  fields  either  impinging  on  the  atmosphere  or 
originating  within  the  atmosphere  itself.  Expressions  for  the 
source  function  are  siommarized  in  Table  2.  In  particular,  the 
scattering  source  function  for  a clear  atmosphere  is 

*^NC,S  ^ 

= -r  V (r;  )I  )t  (r,  r ; 

4 V o o o V o 


+ ;^  I P^(?;  fi')da>*  (4) 

where  I^  is  the  incident  radiation,  t the  transmission  function 
and  P the  scattering  phase  function  (or  phase  matrix  for  a light 
vector) . It  receives  contributions  from  both  single  (SS)  and 
multiple  (MS)  scattering  processes  in  the  atmosphere  (first  and 
second  term,  respectively,  in  the  right-hand  side  of  the  equation) • 
The  integral  equation  satisfied  by  g,  the  so-called  aiaxiliary 

equation  of  radiative  transfer  for  the  source  function,  is  obtained 
by  substituting  Eqs.  (3a)  and  (3b)  in  the  second  term  in  the  right- 
hand  side  of  Eq.  (4) . The  kernel  of  these  equations  is  a linear 
combination  of  the  scattering  phase  functions  (or  phase  matrices) 
for  gaseous  (superscript  g)  and  particulate  (superscript  p)  con- 
tributions . 


P = a P^  + (1  - a )P^ 
V V V V V 


where  the  weight,  called  turbidity  factor,  is  given  by 


(5) 


a = 


0^  + 0^ 
V V 


(6) 


For  some  gases,  it  may  be  necessary  to  take  into  account  their 
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TABLE  2 


Source-Function  Expressions  Corresponding 
to  Some  Typical  Atmospheric  States* 


ATMOSPHERIC  STATE 

SOURCE-FUNCTION  EXPRESSION 

1.  LOCAL  THERMODYNAMIC 

EQUILIBRIUM: 
SCATTERING  NEGLECTED 

■'nC.NS  ■ 

2.  LOCAL  THERMODYNAMIC 

EQUILIBRIUM: 
SCATTERING  INCLUDED 

■’nc  ' ■’nc.s  ■'nc.ns 

3.  PARTLY  OBSCURED  BY 

HAZE  OR  CLOUD 

J*"®  = N J°  + (1-N)  UN  INFRARED) 

4.  ANY  ARBITRARY 

COMBINATION  OF 
PREVIOUS  CASES 

LINEAR  COMBINATION  OF  PREVIOUS 
SOURCE  FUNCTION  EXPRESSIONS 

•NC  - NOT  CLOUDY.  NS  » NOT  SCATTERING,  NO  ■ NOT  OBSCURED 
S ■ SCATTERING,  0 ■ OBSCURED,  PO  ■ PARTLY  OBSCURED 

cJv  ■ • SINGLE  SCATTERING  ALBEDO,  • o9  +aP,  - Kg  + Kg 

N - FRACTION  OF  CLOUDINESS  OF  The  SKY,  T 'TEMPERATURE,  P ■ PRESSURE 

nonsphericity;  in  this  case,  the  phase  function  is  that  corre- 
sponding to  Rayleigh-Cabannes  scattering 

= b + (1  - b )P^  (7) 

V V V V V V 

itself  a linear  combination  of  the  Rayleigh  phase  function  (super- 
script R) , applicable  to  scattering  by  spherical  molecules,  and 
the  isotropic  (superscript  I)  phase  function.  For  continuum  fre- 
quencies, V = f,  the  weight  = 2(1  - 6)/(2  + 6),  where  6 is  the 
so-called  gaseous  depolarization  factor  (e.g. , b^  = 0 for  isotropic 
scatterers;  b^  = 1.0  for  Rayleigh  (spherical)  particles  or  mole- 
cules; b^  = 0.4  for  rod-like  scatterers;  and  0.4  < b^  < 1.0  for  all 
shapes  between  a rod  and  a sphere)  while  for  spectral  line  fre- 
quencies, V = £,  it  is  the  well-known  Hamilton's  coefficient  for 
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resonantly  fluorescent  line  scattering  (e.g.,  = 0.5  for  H-Ly a ) - 

If  the  particulates  are  spherical,  is  provided  by  the  Mie 
expressions.  If  denote  the  elements  of  matrix  P,  i,  j = 1 to 

4,  we  have  the  following  normalization  condition 


JL 

4it 


Pll(r; 


Q' 


f2)doi  = 1 


(8) 


(P^^  is  none  other  than  the  phase  function.)  The  phase  function- 
(or  phase  matrix)  also  satisfies  a number  of  reciprocity  and  sym- 
metry relations  (not  given  here) . 


The  equations  summarized  so  far  provide  the  necessary  formal- 
ism for  remote  sensing  from  above  the  atmosphere  or  from  the  sur- 
face and  for  looking  at  arbitrary  targets  along  arbitrary  directions. 
They  include  (or  can  be  used  to  provide)  equations  appropriate  for 
nadir  or  zenith  viewing,  twilight,  limb,  and  terminator  scans. 


III.  TEMPERATURE  SOUNDING  PROBLEM 

The  basic  assumptions  are  here:  plane-parallel  atmosphere, 
in  Local  Thermodynamic  Equilibrium  (LTE) , free  of  clouds  and 
aerosols;  negligible  gaseous  scattering;  negligible  surface  reflec- 
tion of  downward  a-fmospheric  radiation;  and  known  active  gases. 

The  following  paragraphs  provide  the  relevant  equations  for  the 
thermal  and  the  microwave  sounding  of  the  temperature  structure. 

For  simplicity,  and  with  the  exception  of  some  remarks  on  the  limb 
geometry,  we  shall  essentially  limit  these  derivations  to  nadir 
and  zenith  viewing  geometries. 


A.  Thermal  Sounding 

The  relevant  equations,  in  differential  form,  are  obtained 
simply  from  the  derivations  of  the  previous  section 


+ M 9I^(z;  u) 

k' (z)p (z)  8z 


B[T(Z)  ] 


I^(z;  y) 


(9) 
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where  k'(z)  = k(z)/p(z)  is  the  mass  absorption  coefficient,  and  p 
is  the  absorbing  gas  density.  It  is  easy  to  convert  this  equation 
from  an  altitude,  z,  to  a pressure  variable,  p,  by  use  of  the 
hydrostatic  equation 


- p(z)dz 


w(p) 

g(p) 


dp 


where  w is  the  mass  mixing  ratio  of  the  absorbing  gas,  and  g is  the 
acceleration  of  gravity.  Equation  (9)  must  be  solved  subject  to 
the  boundary  conditions: 

1^(0;  y)  = b[t(0)]  (Nadir  viewing)  j 

y (10) 

l”(z;  y)  = 0 (Zenith  viewing)  ( 

where  z is  the  altitude  of  the  atmospheric  top  (or  of  the  sensing 
platform).  Integrations  of  Eqs.  (9)  and  (10),  or  use  of  Eqs.  (3) 
and  (10) , yields  straightforwardly 

I^(y;  y)  =B[T(y  )]t[y,  y ; w (y) ; T(y) ; y] 
s s X/ 


b[t (y)  ]-^  t[y , y;  w^  (y)  ; T(y);  y]dy  (11a) 


I (y^;  y)'  = - 


B[T(y)  t[y,  y^;  w^(y);  T(y);  y]dy  (11b) 


where  y e p,  log  p,  ...  is  a single-valued  function  of  the  pres- 
sure / and 


t(y^,  y^;  y)  = exp 

y . 

(12) 


is  the  slant  transmission  function  between  levels  and  y^;  its 
derivative  is  now  commonly  referred  to  as  the  weighting  function. 
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B.  Microwave  Sounding 


Equations  (9)  and  (11)  are  still  valid  in  this  case,  but 
Planck's  function  can  be  replaced  by  its  approximation  provided 
by  the  Rayleigh- Jeans  formula - 

C.  Properties  of  the  Weighting  Functions 

The  CO^  transmittances  and  weighting  functions  corresponding 
to  the  National  Oceanic  and  Atmospheric  Administration' s NOAA  2- 
Vertical  Temperature  Profile  Radiometer  are  illustrated  in  Fig.  2. 
The  following  general  properties  of  the  weighting  functions  can  be 
listed: 

1.  For  both  infrared  (IR)  and  microwave  (MW)  radiation,  they 
are  weak  functions  of  the  temperature,  mainly  through  line  broad- 
ening . 

2.  Each  function  covers  a finite,  although  extensive,  portion 
of  the  atmosphere  and  exhibits  a generally  broad  peak.  This 
results  in  low  vertical  resolution  of  the  retrieved  temperature 
profile,  i.e.,  the  fine  scale  structure  cannot  be  recovered-  It 
may  be  noted,  however,  that  the  weighting  functions  corresponding 
to  limb  observations  can  be  much  narrower,  thus  permitting  a finer 
resolution  than  with  other  geometries. 

3.  Some  functions  can  exhibit  several  maxima  which  result  in 
inversion  ambiguities.  To  avoid  this  situation,  it  is  best  to  dis- 
card such  functions - 

4.  The  functions  are  overlapping,  i.e.,  they  are  mutually 
correlated,  the  degree  of  correlation  increasing  with  the  number 
of  channels  (frequency  or  tangent  height) . Methods  for  retrieving 
the  temperature  profile  will  later  be  described.  In  those  methods 
where  the  integral  remote  sensing  equation  is  approximated  by  a 
linear  algebraic  system,  this  correlation  manifests  itself  by  the 
linear  dependence  (ill-conditioning)  of  the  colimin  vectors  of  the 
operator  matrix.  As  the  number  of  channels  is  increased,  so  does 
the  niomber  of  column  vectors  and  their  linear  dependence. 
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CO^  TRANSMIHANCES 


Fig.  2.  NOAA  2 - Vertical  Temperature  Profile  Radiometer 
(from  Ref.  1) . (Note:  Only  center  frequencies  are  listed.  Filter 
bandpasses  at  half-widths  are  provided  within  parentheses  in  the 
insert.)  (1  bar  - 100  kPa.) 


D.  Nature  of  the  Mathematical  Problem 

The  remote  sensing  equations,  Eqs.  (9)  or  (11),  are  essentially 
nonlinear  in  the  temperature  profile.  The  sources  of  nonlinearities 
are  summarized  as  follows: 

1.  Temperature  dependence  of  the  transmission  function:  weak 
in  all  wavelength  regions  of  interest. 

2.  Frequency  dependence  of  Planck's  function:  weak  within  a 
single  IR  band,  stronger  when  several  IR  bands  are  considered 
together;  none  in  the  MW  region. 

3.  Scattering  by  gases  and  particulates:  weak  (usually  not 
considered) . 

4.  Surface  reflection  effects:  weak  (usually  not  considered). 
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5.  Presence  of  clouds  and/or  hazes:  can  be  large. 

6.  Mathematical  nonlinearities  (constraints,  etc.):  can  be 
large . 


However,  when  utilizing  a single  IR  band  or  in  the  MW  region,  it 
is  possible  to  separate  the  frequency  and  temperature  dependences 
of  Planck’s  function  by  using  a variety  of  approximations.  In  this 
case,  the  integral  remote  sensing  equation  can  be  written  as  the 
first-kind  Fredholm  integral  equation: 


I(v  J 


K(v^;  y)B(y)dy,  (j  = 1,  2, 


n) 


(13) 


where  are  the  n sounding  frequencies,  the  kernel  K is  the  trans- 
mission function  (since  it  exhibits  a much  weaker  variation  with 
T than  does  B) , B stands  for  the  T-terin  in  Planck’s  function,  and 

j / N _ Measurement  (-  surface  term) 
j v-term  in  Planck’s  function 

Prior  to  reviewing  the  methods  of  solution  of  the  temperature 
sounding  problem,  we  shall  first  analyze  the  particulate  sounding 
problem  in  order  to  bring  into  more  evidence  the  commonality  and 
differences  between  these  two  problems. 


IV.  PARTICULATE  SOUNDING  PROBLEM 

The  basic  assumptions  for  this  problem  are:  plane-parallel 
atmosphere,  completely  hazy  or  cloudy;  known  contributions  from 
gases;  no  thermal  radiation  effects;  and  known  surface  reflection 
effects. 

A.  Reflected  and  Transmitted  Radiation 

The  form  of  the  remote  sensing  equations  used  in  this  case  is 

±y  i±(t;  n)  - j(t;  t)  (14) 

where 
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J = w[J(SS)  + J(MS) ] 


is  provided  by  Eq.  (4) , and  the  boundary  conditions  are 

fi)  = Prescribed  surface  . 

reflection  Reflection 

I (0;  = Radiation  source  ' 

s tr  eng th  Tran  smi s s i on 

The  corresponding  integral  formulations  are  as  given  in  Eqs* (3a) - (3c) . 

For  transmitted  radiation,  it  is  interesting  to  note  that  it 
is  possible  to  separate  the  direct  from  the  diffuse  component.  In 
the  former  case,  the  corresponding  spectral  extinction  measurements 
are  perfoimed  by  pointing  in  the  source  direction.  In  this  direc* 
tion,  direct  radiation  is  predominant,  and  the  extinction  process 
conserves  any  incident  polarization.  Thus,  while  these  measure- 
ments are  restricted  to  a single  direction  and  polarization  infor- 
mation is  lost,  nevertheless,  the  complexity  of  the  corresponding 
mathematical  problem  is  considerably  reduced.  The  equation  for 
this  case  is  obtained  by  letting  J = 0 since  there  is  no  self- 
illumination of  the  medium  by  scattering,  and 


I (t^;  = I (0;  n)e  (16a) 

a strictly  scalar  problem  irrespective  of  the  initial  polarization. 
Away  from  this  direction,  direct  radiation  vanishes,  and  Eq.  (3b) , 
or  Eq.  (14)  with  a null  boundary  condition,  yields 


l"'(T^;  h 


•T 

^ J(s; 
V 


Q)  e 


1 


- s)/y 


ds 


(16b) 


a vector  problem  if  polarization  is  considered - 

The  problem  unknowns  are,  in  the  macroscale:  the  volume 
absorption  and  scattering  coefficients  and  their  vertical  profiles 
(from  which  the  scattering,  absorption,  and  total  optical  depths 
at  any  level  in  the  atmosphere,  the  corresponding  optical  thick- 
nesses of  the  atmosphere,  and  the  single  scattering  albedo  and  its 
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vertical  profile  can  be  obtained) , and  the  scattering  phase-function 
(-  matrix) . If  the  absorption  and  scattering  coefficients  are 
known,  then  it  is  sufficient  to  determine  the  number  density  pro- 
file in  place  of  the  corresponding  volume  coefficients.  In  the 
microscale,  these  parameters  involve  the  unknown  complex  refrac- 
tive index,  the  particle  size  distribution,  and  their  vertical  pro- 
files. 


B. 


Spectral  Extinction 

Assuming  that  the  particulates  are  spheres,  Eq.  (16a)  becomes 

|l"(T^;  I 


E[(A)  ;m(A)  ; n(x)  ] = - 


- y iln 


I (0;  Z) 


= TTk 


-3 


ext 


[x; 


m^(X) 


(X) ]n (x) dx 


(16c) 


where  k = 27t/X  is  the  wavenumber,  x = 2Trr/X  is  the  size  parameter, 
r - radius,  m^  and  m^  are,  respectively,  the  real  and  imaginary 
components  of  the  refractive  index,  n(x)  is  the  size  distribution, 
and  is  the  Mie  efficiency  factor  for  extinction 


4 2 °° 

Q ^ = — Re  {S(0)}  = — y (2n  + l)Re  (a  + b ) (17) 

ext  2 2 ^ n n 

X X n = 1 

In  this  last  expression,  S(0)  is  the  Mie  complex  scattering 
amplitude  function  in  the  forward  scattering  direction,  and  a^  and 
b are  the  well-known  Mie  coefficients.  For  Im  - l|  <<  1, 
van  de  Hulst  has  provided  the  following  approximation  to 


Q = 4 Re{K(ip  + p tan  g) } 

rl 


(18a) 


where 


K(w) 


w 


-w 


e 


2 

w 


1 
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p = 2xlm  - l|  , and  tan  3 = - 1)  - The  approximation  in  Eq.  (18a) 

is  found  in  practice  to  apply  well  for  1.33  < < 1.5.  For 

1-0  < < 1.5  and  0.05  1 £ 0.25,  Deirmendjian  has  provided  the 

following  .ad  hoc  extension  of  Q : 

H 

a + D)Q^  (18b) 

where  D is  a known  constant. 

If  and  are  known,  or  constant  over  the  wavelength  range 
of  the  measurements,  or  else  slowly  varying  within  this  range, 

Eq-  (16c)  can  be  written  as  a linear  first-kind  Fredholm  integral 
equation  in  the  size  distribution: 


E(X^) 


* 

K(X.;  x)N(x)dx  (j  = 1,  2, 


n) 


(19) 


where  K = Q (or  x^Q  ) and  N = x^n(x)  [or  n(x)]. 


C.  Single  Scattering  in  a Homogeneous  Slab 

The  corresponding  vector  equations,  obtained  by  canceling 
J(MS)are*  respectively,  on  reflection  and  transmission: 

l'^(0;  y,  y.)  = P[0;  m;  n(x)]l"(0;  y.)  (20a) 

0 4 a.  0 

I (t^;  y,  y^)  = ^ P[0;  in;  n(x)]  I (0;  y^)  (20b) 

where  0 is  the  scattering  angle,  and 


f E 


g = 


^0  ^ 


yo  - 


- T, 


1 - e 


^oJ 


- e 


(20c) 
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Further,  for  x,  «1,  f = g=^T,/y,  S>t-,=  t:  ( = scattering  optical 

thickness) , and  Eqs.  (20a)  and  (20b)  degenerate  to  the  single 
expression 

T 

I (0;  y,  y^)  = I (0;  ^ ^0^ 

For  incident  light  that  is  either  natural  or  fully  plane -polarized, 
Eq-  (21)  can  be  reduced  to  a scalar  one.  If  the  phase  - function 
(-  matrix)  is  known,  or  modeled,  this  equation  can  be  used  for 
retrieving  Alternatively,  if  is  known  independently,  and 

if  m is  also  known,  the  same  equation  can  be  written  as  a first 
kind  Fredholm  integral  equation  in  the  size  distribution 

4yl'^(0;  y,  y ) r 

-r-  . = P(0)  = P(G;  x)n(x)dx  (22a) 

T I (0;  y _)  _ 

sc  0 ■'X 

where  P has  different  expressions  depending  on  the  incident  polar- 
ization. 

D.  Near-Forward  Scattering 

Let  be  the  length  of  the  scattering  medium.  Since  this 

medium  is  homogeneous,  t = a£.  By  introducing  the  scattering  dia- 

sc 

gram  F = aP,  Eq.  (22a)  can  be  written  as 

F(0;  x)n(x)dx  (22b) 

For  incident  light  that  is  plane-polarized  either  along  the  vertical 
or  the  horizontal,  the  two  equations  (22b)  for  this  case  degenerate 
to  a single  equation  for  all  scattering  angles.  On  the  other  hand, 
for  natural  light,  this  is  the  case  only  at  forward  scattering 
angles.  In  this  latter  case,  in  the  Kirchhoff  approximation  to 
Fraunhoffer  diffraction, 

x^J^(x  sin  0) 

F(0;  x)  = (23a) 

k^sin20 


"e  = 


I"(0;  Pq) 
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I III  II 


A considerably  better  approximation,  provided  by  the  works  of 
Penner,  and  Shifrin  and  Punina,  is 

* ^ext 

F (0;  X)  - x)  (23b) 

This  approximation  is  valid  for  0 ~ 10^  and  x - 5.  A detailed 

analysis  of  its  accuracy  (which  is  actually  quite  high)  for  a set 
of  refractive  indices  is  not  available.  With  Eq.  (23b) , Eq.  (22b) 
becomes 

1*2  * 

j = j (xsinO)x^n  (x)dx  (24) 

k^sin^e 

where  n (x)  = (l/4)Q^^^n(x) , and  is  Bessel  function  of  the  first 
kind  and  order  unity. 

E.  Observables  and  Corresponding  Mathematical  Problems  for 
Reconstruction  of  Particle  Size  Distribution 

The  situation  is  siimmarized  in  Fig.  3,  which  is  self- 
explanatory.  It  may  be  noted,  however,  that  under  certain  approxi- 
mations, the  inverse  problem  for  the  individual  spectral  extinctions, 
and  for  the  near-forward  radiances  arising  from  single  scattering, 
can  be  solved  analytically.  For  multiple  scattering,  for  spectral 
extinction  ratios,  and  for  single  scattering  radiance  and  polar- 
ization ratios,  the  problem  is  amenable  to  a minimization  search. 

In  all  other  cases,  the  problem  is  that  of  solving  a first-kind 
Fredholm  integral  equation.  Since  the  latter  equation  is  of  funda- 
mental importance  in  both  the  temperature  and  the  particulate 
sounding  problem,  we  shall  now  turn  to  its  analysis. 

V.  ON  THE  INVERSION  OF  FIRST-KIND  FREDHOLM  INTEGRAL  EQUATIONS 
Consider  the  first-kind  equation: 

f(x)  = A(x,  y)g(y)dy  (x  e X)  (25) 

•Y 
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Figr.  5.  Observations  and  corresponding  mathematical  problems 
for  reconstruction  of  particle  size  distribution  (from  Ref.  2)  . 


There  are  four  important  questions  bearing  on  the  solution  of  such 
an  equation  (Ref.  3) . 


A.  Questions  of  Existence,  Uniqueness,  Stability  and  Construction 

Existence.  What  conditions  must  be  placed  upon  f to  ensure 
that  there  is  a corresponding  g? 

Uniqueness . Given  that  a solution  g^  exists,  are  there  any 
others,  g^,  g^f  etc.  also  satisfying  the  basic  equation? 
Alternatively,  are  there  any  nontrivial  solutions  h(y)  to  the 
equation 

A(x,  y)h(y)dy  = 0 

If  the  answer  is  negative,  then  g(y)  is  unique.  Otherwise,  h e H 
(annihilator  of  A) , and  the  knowledge  of  f (x)  can  tell  nothing 
whatsoever  about  those  parts  of  g(y)  that  belong  to  H;  these  must 
be  deduced  from  information  other  than  that  contained  in  f (x) . 

Stability . Granted  existence  and  uniqueness,  does  g(y)  depend 
continuously  on  f (x) ? Alternatively,  does  the  closeness  of  the 
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computed  f (x)  to  the  measured  f (x)  entail  a similar  closeness  of 
the  inferred  g(y)  to  the  actual  g(y)? 


f(x)  = f(x) 


CLOSE 


f(x) 


g(y) 


If  the  answer  is  in  the  affirmative,  then  g(y)  is  stable;  other- 
wise g(y)  is  unstable.  Physically,  instability  is  the  case  every 
time  the  kernel  tends  to  smooth  the  behavior  of  the  required  func- 
tion for  all  values  of  the  parameter  x.  Under  certain  circumstances, 
the  linearized  temperature  sounding  problem  for  a thick  atmosphere 
can  be  reduced  to  the  convolution  problem  (Ref.  4) 


f(x) 


.00 

a(x  - y)g(y)dy 

— c» 


(26a) 


The  universal  kernel  in  this  equation  is  plotted  in  Fig.  4(a). 

The  Fourier  transform  of  Eq.  (26)  is 

F(k)  = A(k)*G(k)  (26b) 


a (-y)  = exp 


-(-y)  - exp  -(-y) 


/ 


Fig.  4a.  The  universal  weighting  function  (from  Ref.  4). 
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The  filtering  function  A(k)  is  drawn  in  Fig.  4(b).  It  corresponds 
to  a low-pass  filter.  It  is  seen  that  A(k)  drops  very  sharply 
with  increasing  k,  e.g.,  for  k = 7 it  drops  by  four  orders  of  mag- 
nitude. After  a certain  k-value,  the  value  of  A(k)  may  drop  below 
the  experiment  noise.  • In  this  case,  the  inversion  would  correspond 
to  the  noise  rather  than  to  the  signal.  This  illustrates  the  neces- 
sity of  applying  a constraint  in  order  to  limit  the  niomber  of  k 
values  that  can  be  admitted.  This  niomber  is  also  intimately  con- 
nected with  the  information  content  of  the  measurements  under  con- 
sideration. 

Construction.  Granted  existence  and  uniqueness,  can  a pro- 
cedure be  found  which  will  generate  g(y)  from  f (x)  to  any  requested 
finite  accuracy  in  a finite  number  of  steps? 


CUT-OFF  FOR 
HIGH  FREQUENCIES 


Fig.  4b.  Amplitude  of  the  Fourier  spectrum  A(k)  of  the 
weighting  function  (from  Ref.  4)  . 

B.  A Preliminary  Classification  of  Methods  of  Solution 

Since  the  actual  solution  cannot  be  recovered  from  the  measure- 
ments, there  are  three  courses  of  action  that  are  left  open; 

1.  Derivation  of  properties  that  all  solutions  share,  which 
must  then  be  properties  of  the  true  solution. 

2.  Introduction  of  assumptions  (physical,  mathematical) 


449 


about  the  solution  to  restrict  the  class  of  admissible  solutions. 

3.  Investigation  of  alternative  concepts  or  conditions. 

This  is  the  basis  of  the  classification  provided  in  Fig.  5 
for  retrieval  of  the  temperature  profile,  and  in  Fig.  6 for  the 
reconstruction  of  the  particle  size  distribution.  A review  of  the 
various  methods  there  listed  has  been  provided  elsewhere  (Ref.  5)  . 

A number  of  spectral  expansion  and  least-squares  techniques  (e.g. , 
truncated  singular  value  decomposition,  stepwise  regression, 

Cholesky  decomposition,  and  modified  Gram-Schmidt  orthogonalization) , 
the  reduction  to  a second -kind  Fredholm  integral  equation,  invar- 
iant imbedding,  etc.,  remain  to  be  explored,  or  have  not  been  suf- 
ficiently explored.  Likewise,  alternative  or  complementary  con- 
cepts, such  as  the  scattering  processes  discussed  in  the  Introduction 
section,  need  to  be  further  investigated.  In  the  particulate 
sounding  problem,  the  climatological  methods  are  not  applicable, 
and  the  analytical  methods  are  integral  transforms. 

These  tabulations  of  methods  will  be  updated  as  new  con- 
cepts and  methods  are  developed  for  the  solution  of  the  two  prob- 
lems of  temperature  profile  and  particle  size  distribution  recon- 
structions . 

VI.  A PRELIMINARY  CLASSIFICATION  OF  NONLINEAR  SOLUTION  METHODS 

The  methods  of  solution  (Fig.  7)  are  here  necessarily  itera- 
tive. For  the  temperature  sounding  problem,  Newtonian  iteration, 
relaxation,  and  Marquardt  algorithm  have  been  used.  A number  of 
other  nonlinear  least- squares  and  search  methods  are  available  for 
study.  The  same  conclusion  applies  to  the  size  distribution  prob- 
lem where  only  the  minimization  search  technique  and  some  of  its 
variants  have  been  used. 
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Fig.  5.  Classification  for  retrieval  of  temperature  profile 
(From  Ref,  5 ) . 

VII.  A STRATEGY  FOR  THE  PARTICULATE  SOUNDING  PROBLEM 

This  is  a two-step  approach  in  which  the  complex  refractive 
index  is  first  determined,  independently  of  the  particle  size 
distribution,  from  the  inversion  of  spectral  transmission  ratios 
in  a narrow  wavelength  range.  This  parameter  known,  the  particle 
size  distribution  is  then  reconstructed  from  an  analytical  inver- 
sion of  single  scattering  radiances  in  a narrow  forward  cone. 
Whenever  the  conditions  attending  single  scattering  are  no  longer 
valid,  the  single  scattering  solution  can  be  iterated  using  a 
number  of  schemes  to  provide  the  multiple  scattering  solution. 

The  approach  is  well  adapted  to  a limb  scanning  experiment  in  order 
to  also  retrieve  the  vertical  profiles  of  the  unknown  parameters - 
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methodology  for  reconstruction  of 

PARTIOf  SIZE  DISTRIBUTION 


Fig.  6.  Classification  for  reconstruction  of  particle  size 


distribution  (From  Ref.  5) . 


Fig.  7.  Methodology  for  reconstruction  of  temperature 
profile. 
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A.  Retrieval  of  the  Complex  Refractive  Index:  Spectral  Extinction 
Ratio  Approach 

Return  to  Eq.  (16c)  providing  the  expression  of  the  spectral 

extinction,  again  under  the  assumption  that  the  gaseous  contri- 

q 

bution  (t|)  is  known.  Figure  8 illustrates  the  variations  of  the 
extinction  efficiency  factor  as  a function  of  particle  radius  for 
several  wavelengths.  At  any  wavelength,  the  main  contribution  to 
the  measurement  E(A)  (see  Eq.  (16c)  comes  from  the  first  strongly 
marked  peak  of  the  corresponding  curve.  If  the  wavelength  inter- 
val considered  is  sufficiently  narrow,  then  is  essentially 

independent  of  particle  size.  For  example,  for  wavelengths 
extending  from  0.4  to  0.9  ym,  the  contribution  to  solar  extinction 
measurements  originates  from  particles  in  the  size  range  0.3  to 
0.6  ym.  This  latter  range  would  be  further  reduced  if  the  wave- 
length interval  were  smaller.  This  observation  was  used  for 
developing  a method  of  retrieving  the  complex  refractive  index  of 
aerosols  in  a manner  that  is  essentially  independent  of  size  dis- 
tribution. The  same  proced\ire  can  be  applied  to  a set  of  wave- 
length intervals  in  order  to  obtain  the  spectrum  of  this  parameter. 
A complete  description  of  the  method  has  been  provided  elsewhere 
(Ref.  6) . Briefly,  we  consider  the  set  of  spectral  ratios: 


E(A  ) 

R = - - ■ ”-r 
uv  E(A  ) 

V 


(27) 


where  A^  and  A^  are  any  two  wavelengths  of  the  ensemble  of  N wave- 
lengths recorded,  A^  being  fixed  for  any  particular  set.  Obviously, 
there  are  N such  sets.  For  simplicity,  we  shall  model  the  size 
distribution  by  the  gamma  function 

n(x)  = Constant  x^  e (28) 

where  the  two  distribution  parameters  a and  8 are  expressed  in 
terms  of  moments  of  n(x)  . Such  a model,  however,  is  neither 
required  nor  necessary  for  one  can  generate  the  actual  distribution 
from  some  initial  guess,  n^^^  (x) , by  using  the  relation 
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Fig.  5.  Extinction  efficiency  factor  as  a function  of 
particle  radius  for  several  wavelengths  (from  Ref,  7;. 


n(x^)  = (x^)  = 0,  1,  2,  ...  ,) 

where  the  parameters  6^  are  unknowns  of  the  problem.  The  inverse 

problem  then  consists  in  determining  m^,  m^,  and  a and  3 (or  the 

6.*s)  from  the  measured  values  of  R at  a set  of  values  of  the 
I uv 

independent  variable  X^.  We  first  reformulate  the  inversion  prob- 
lem as  a problem  in  minimization  theory.  Denote  the  measured 

ratios  by  R E R {m  , m.;  a,  3 (or  6.)}  and  the  ratios  computed 
uv  r 1 

from  Eqs.  (28)  and  (16c)  by  R E R {m  , m. ; a,  3 (or  6.)}.  Then, 

uv  uv  r 1 1 

define  the  absolute  (or  relative)  deviations  D.  = R.  - R.  T or 

jn  jn  jn  ^ 

(R.  - R.  )/R.  ],  n = 1,  2,  N - 1,  and  the  deviation  vector 

D ^ j u j n 

Dt  = (D.  , D.  , ...  D.„  , ) - where  the  superscript  T denotes 

^3  31  32.  :N  - 
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III 


matrix  transposition.  In  these  definitions/  j=l/  2,  ...N 
refers  to  the  fixed  wavelength  of  the  particular  set  of  ratios 
studied. 


The  problem  is  to  minimize  the  objective  function: 

S.  {m  / m. ; a,  g (or  S )}  = dT  D.  (29) 

D r 1 Z 

representing  the  sum  of  squares  of  deviations.  It  may  be  noted 
that  while  in  Eq.  (29)  we  use  the  Euclidean  norm,  any  other  norm 
could  be  considered.  One  could  also  define  the  objective  function 
by 


S-.  {m  , m.;  a,  g (or  6.)}  = D W D. 
j r 1 Z 'vj  A.j 


(30) 


where  W is  a weighting  matrix  reflecting  the  degree  of  confidence 
in  the  individual  measurements/  their  spectral  resolution/ 
accuracy,  etc.  It  may  be  noted  that  the  function  S.  is  the  equa- 
tion of  a hypersurface  in  the  parameter  space  of  dimensions: 

m / m, , a and  gorm,m./  6.,  6^,  ...  6..  Equivalently,  then, 
r 1 r 1 -L  ^ 36 

the  inverse  problem  consists  in  finding  the  minimum  of  the  surface 

S..  This  minimum  is  located  by  the  author's  Minimization  Search 
D 

Method  (MSM)  (Ref.  6). 


Figure  9 provides  the  contour  curves  of  the  surface 
for  Aj  = 0.4  ym  and  size  distribution  parameters  g = 9,  15  and  23. 

The  true  values  of  the  unknown  parameters  were  taken  to  be  m^  = 1.44, 
m^  = 0.03,  a = 2 and  g = 15  - The  data  ("measurements”)  were  gen- 
erated in  the  computer  for  the  nine  wavelengths:  A^  = 0.4,  0.45, 

0.49,  0.525,  0.575,  0.61,  0.64,  0.67  and  0.7  ym.  The  figure  shows, 
irrespective  of  the  g-value,  that  the  unique  minimum  of  the  sur- 
face S is  always  located  at  = 1.44,  m^  = 0.03,  i.e.,  at  the  true 
value  of  the  complex  refractive  index.  This  result  is  not  sur- 
prising in  light  of  our  earlier  remarks  in  connection  with  Fig.  8. 

It  serves  to  demonstrate  that  the  extinction  ratios  are  insensitive 
to  the  size  distribution.  As  a further  illustration  of  this 


455 


insensitivity.  Table  3 is  provided.  This  table  shows  that  even 
for  highly  accurate  measurements  within  0.1%  accuracy,  the  insen- 
sitivity to  B ranges  from  S = 8 to  B = 23  approximately.  Figure  9, 
together  with  Table  3,  demonstrate  conclusively  the  insensitivity 
of  spectral  extinction  ratios  within  a narrow  wavelength  interval 
to  the  particle  size  distributuion.  They  are,  however,  extremely 
sensitive  to  the  refractive  index,  an  interesting  circumstance. 

An  error  analysis  of  this  method  is  summarized  in  Table  4 for 
various  wavelength  sets  and  experimental  accuracies.  For  example, 

TABLE  3 

Illustrating  the  Insensitivity  of  a Set 
of  Spectral  Extinction  Ratios  to  the 
Particle  Size  Distribution 
Parameter  (from  Ref.  6) 


Ar; 

0.45 

0.49 

0.525 

0.575 

0.61 

0.64 

0.67 

0.70 

X--X--X- 

- X--X-- X- - 

X 

1 

1 

X 

1 

1 

X 

8 

0.0264 

0.0298 

0.0445 

] 0.0832 

0.1264 

1 0.1696 

0.1908 

0.1989 

X-  - X-  -X-  -X 

9 

0.0097 

0.0110 

• 0.0164 

1 

0.0306  1 

0.0466 

0.0625 

0.0702 

0.0735 

10 

0.0035 

0.0040 

; 0.0060 

0.0112  : 

0.0171 

0.0229 

0.0257 

0.0273 

11 

0.0013  1 

0.0015 

0.0022 

0.0042  ; 

0.0062 

0.0084 

0.0089 

0.0094 

12 

0.0004 

0.0006 

0.0008 

1 0.0015 

0.0023 

0.0030 

0.0030 

0.0038 

13 

0.0001 

0.0002 

0.0002 

0.0006 

0.0008 

0.0010 

0.00 10 

0.0009 

14 

0.0000 

0.0001 

0.0001 

0.0002 

0.0002 

0.0003 

0.0000 

0.0000 

15 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

16 

0.0001 

0.0001 

0.0001 

0.0001 

0.0001 

0.0002 

0.0000 

0.0000 

17 

0.0002 

0.0002 

0.0004 

0.0003 

0.0003 

0.0004 

0.0010 

0.0000 

18 

0.0004 

0.0007 

0.0008 

0.0009 

0.0010 

0.0010 

0.0010 

0.0009 

19 

0.0012 

J 0.0019 

0.0023 

0.0024 

0.0025 

0.0025 

0.0030 

0.0028 

20 

0.0033 

0.0052 

1 0.0061 

0.0067 

0.0066 

0.0066 

0.0069 

0.0066 

21 

0.0088 

0.0141 

; 0.0167 

. 1 

0.0182 

0.0177 

0.0177 

0.0188 

0.0179 

22 

0.0238 

0.0384 

0.0453 

0.0493 

0.0482 

0.0478 

0.0494 

0.0490 

X 

1 

1 

X 

1 

1 

X 

1 

X 

1 

1 

X 

1 

1 

X 

1 

1 

X 

1 

X 

1 

1 

X 

1 

I 

X 

X 

1 

1 

X 

1 

1 

X 

1 

1 

X 

1 

X 

1 

1 

X 

1 

1 

X 

X 

1 

I 

X 

1 

X 

1 

1 

X 

X 

1 

X 

1 

1 

X 

1 

1 

X 

1 

--X--X-- 

23 

0.0645 

0.1041 

0.1229 

0.1336 

•0.1305 

0.1304 

0.1325 

0.1320 

24 

0.1742 

0.2817 

0.3325 

0.3617 

0.3532 

0.3503 

0.3588 

0.3563 

25 

0.4675 

0.7570 

0.8943 

0.9733 

0.9502 

0.9432 

0.9658 

0.9595 

26 

3.1770 

2.9293 

1.8157 

0.3927 

0.0210 

0.7765 

0.9262 

0.6098 

27 

1.7884 

0.0926 

0.8605 

1.7033 

0.1176 

0.5073 

2.7075 

3.5535 

28 

3.3692 

1.5580 

4.3406 

6.0713 

2.9787 

4.6707 

2.3339 

2.4582 

29 

6.9479 

5.2720 

4.1998 

2.9636 

5.7156 

4.0362 

10.8785 

10.9422 

30 

16.3302 

4.0105 

11.6555 

10.8407 

6.6112 

7.8650 

1.3394 

2.5600 

31 

16.7168 

4.6318 

11.0039 

10.1242 

5.8426 

7.0861 

2.2034 

1.7098 

Eiror  upper  bounds  Out  digit  rounded  off): 

= 0.001%; 0.005%;  = 0.01%;  X--X--  = 0.05%; = 0.1%. 
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TABLE  4 


Illustrating  the  Sensitivity  of  the  Inverse 
Solution  for  the  Complex  Refractive  Index 
and  Size  Distribution  Parameter  to  the 
Number  df  Extinction  Ratios  and  Their 
Accuracy  (from  Ref.  6) 


Wavelength 

Number  of 

Method 

/«r 

lAm^l 

-/Wj 

lAm|l 

0 

ratios 

significant 

(%) 

{%) 

figures 

/?00-  ^01. 

true  values 

1.44 

0.03 

15.0 

^02.  ^03- 

guess 

1.38 

0.025 

13.5 

/?04.  ^05- 

^06>  ^07- 

4 

MSM 

1.43998 

0.001 

0.02999 

0.033 

13.1909 

^08 

3 

MSM 

1.44015 

0.010 

0.03007 

0.233 

13.1874 

2 

MSM 

1.43840 

0.111 

0.02776 

7.467 

13.3377 

^00.  ^01. 

3 

MSM 

1.44009 

0.006 

0.03009 

0.285 

13.2460 

/?03.  ^04- 

2 

MSM 

I.437I6 

0.197 

0.02686 

10.483 

13.3280 

^06.  ^08 

fill.  Ris. 

3 

MSM 

1.44093 

0.065 

0.03009 

0-303 

13.3913 

Rn 

2 

MSM 

1.44570 

0.396 

0.02656 

11.455 

13.5131 

using  only  three  channels  at  0.45  vim,  0.61  \xm,  and  0.67  pm,  data 
with  0.1%  accuracy  can  be  used  to  obtain  the  real  and  imaginary 
parts  of  the  refractive  index  to  within  0.07%  and  0.3%,  respec- 
tively. 

B.  Reconstruction  of  the  Particle  Size  Distributuion : Angular 

Forward  Scattering  Approach 

This  approach  rests  on  an  analytical  inversion^  of 
Eq.  (.24)  using  the  Bateman-Titchmarsh  formula.  The  result  is 


^K.  S.  Shifrin,  Calculation  of  a certain  class  of  definite 
integrals  containing  the  square  of  a first-order  Bessel  function, 
Trudy  Vsesoyuznogo  Zaochnogo  Lesotekhnicheskogo  Instituta , 2 (1956) . 
(Periodical  title  unofficially  translated  as  "Proc.  All-Union 
Institute  of  Correspondence  on  Forestry  (or  "Wood  Technology").) 
(Note:  Copy  of  this  article  could  not  be  obtained  from  the  Library 

of  Congress,  the  periodical  could  not  be  verified  in  serial  lists^ 
or  the  article  in  abstract  journals.) 
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3 f d(sin^0lQ) 

n*U)  - - J,(y)Y,(y)y  ^jj5^d(sinG)  (3 

X A0  - 10° 

where  y = x sin  0,  and  is  Bessel  function  of  the  second  kind 
and  order  unity.  This  result  is  extremely  interesting  for  it  shows 
that  n* (x)  can  be  obtained  from  the  data  by  a simple  integration. 
And,  if  the  refractive  index  is  known,  such  as  by  the  approach 
described  previously,  the  size  distribution  n(x)  can  also  be 
obtained.  In  so  doing,  no  a priori  knowledge  of  the  size 
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PARTICLE  SIZE,  r^^rr. 

Fig,  10.  Sample  reconstructions  of  the  particle  size  distri- 
bution with  the  angular  forward  scattering  method  (from  Ref.  3). 

distribution,  or  its  analytical  model,  is  required.  Although  not 
critical,  the  knowledge  of  the  refractive  index  allows  a more 
accurate  reconstruction  at  the  smaller  size  parameter  values.  The 
accuracy  obtained  is  estimated  to  be  better  than  a few  percent. 


The  present  method  of  determining  the  size  distribution 
appears  preferable  to  that  utilizing  spectral  extinction  data 
(Ref.  9)  in  that  (i)  it  does  not  need  an  extended  spectral  inter- 
val (0.35  ym  - 2.27  ym)  over  which  the  aerosol  refractive  index 
must  be  assumed  known  and  constant,  (ii)  the  latter  assumption  is 
more  readily  met  in  the  nearrow  spectral  interval  used  in  our  ratio 
technique  for  retrieving  the  complex  refractive  index;  (iii)  it  is 
not  restricted  to  an  upper  size  limit  of  5 ym  with  systematic  over- 
estimations  at  larger  values. 

Figure  10  provides  some  sample  reconstructions  for  various 

angular  resolutions,  A0,  in  a forward  scattering  cone  of  half 

width,  0 = 450  min.  It  is  seen  that  A0  - 15  min.  provides 

max 

indeed  excellent  reconstructions. 


SYMBOLS 


a 

a , b 
n n 

A 

A(k) 

b 

B 

D 

E 

f(x) 

g 

F 

F(k) 

g 

g(y)  or  h(y) 
G(k) 

I 


turbidity  factor 

Mie  coefficients 

kernel  function 

Fourier  transform  in  Eg.  (26b) 

weighting  factor  in  Eq.  (7) 

Planck's  function 

known  constant  in  Eq.  (18b) . Also  deviation  vector 
in  Eqs.  (29)  and  (30) 
extinction  function  defined  in  Eq.  (16a) 
known  function  (measurements) 
auxiliary  functions  in  Eqs.  (20)  and  (21) 
scattering  diagram 
Fourier  transform  in  Eq.  (26b) 
gravitational  acceleration 
sought  function 
Fourier  transform  in  Eq.  (26b) 
specific  intensity  (I^  = incident  value) 
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K(x) 
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n(x) 
n*  (x) 

N(x) 

P 
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Q 
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r 

R 

uv 

S 

S(0) 


t 

T 

w 

W 

X e X 
y e Y 

Y 


y 

Y. 


source  function 
Bessel  function 
wavenximber 

monochromatic  volume  (mass)  absorption  coefficient 

volume v(mass)  extinction  coefficient 

kernel  function  in  Eq.  (19) 

length  of  scattering  medixim 

complex  refractive  index 

particle  size  distribution  [initial  guess  n (x) ] 

a^/4n(K) 

n(x)  or  x^n(x)  in  Eq.  (19) 
pressure 

phase-matrix  = phase-function)  . Superscripts: 

I = isotropic,  R = Rayleigh,  RC  = Rayleigh-Cabannes, 
g = gases,  p = particulates 
Mie  efficiency  factor  for  extinction 
particle  radius 
radius  vector 
spectral  extinction  ratio 
ob j ective  function 

Mie  complex  scattering  amplitude  in  forward 
direction 

transmission  function 
temperature 

mass  mixing  ratio  of  absorbing  gas 
weighting  function 
size  parameter 

single-valued  function  of  pressure 
value  of  y at  the  surface 

at  top  of  atmosphere  or  at  satellite  altitude 
X sin  0 in  Eq.  (31) 

Bessel  function 
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altitude  (z^  = value  of  z at  lower  boundary;  = 
at  upper  boundary;  z = at  top  of  atmosphere  or  at 
satellite  altitude) 

expression  in  moments  of  size  distribution  in  Eq-  (28) 
direction  cosines 
gaseous  depolarization  factor 
displacement  constants 

zenith  angle  (0  = cos  ^ ]i)  . Subscript  zero  for 
incidence . 

scattering  angle  (A0  = angular  resolution) 
wavelength 

frequency  (v  = f in  continuijm,  v = £ in  spectral 
line) 

absorbing  gas  density,  also  p = 2xlm  - l|  in  Eq. 

(18a) 

volume  (mass)  scattering  coefficient 
optical  depth  (t^  = optical  thickness) 
scattering  optical  thickness 
azimuth  angle 
solid  angle 

single  scattering  albedo  (in  Table  2) 

direction  vector.  Superscript  plus  (minus)  for 
upward  ( downward ) 
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DISCUSSIONS 


Green:  Did  I understand  your  curves  were  going  to  negative  numbers 

of  particles  below  three  microns? 

Fymat:  In  some  of  these  retrievals,  when  the  size  distribution 

is  zero  at  the  origin,  we  have  such  negative  tails-  We  have  numer- 
ical difficulties  in  retrieving  these  null  values.  The  difficulties 
disappear,  however,  when  the  distribution  has  a finite  value  at  the 
origin  or  were  we  to  vary  the  half-width  of  the  forward  scattering 
cone  according  to  the  particular  radius  for  which  we  wish  to  recon- 
struct the  distribution.  The  result  displayed  in  Figure  10  of  my 
paper  included  neither  of  these  two  features,  and  was  aimed  at 
illustrating  results  when  both  the  distribution  starts  from  zero 
and  the  scattering  cone  are  fixed  for  all  radius  values.  In  this 
context,  providing  we  remain  within  the  domain  of  applicability  of 
the  theory,  the  particular  radius  value  at  the  origin  is  irrelevant. 
It  has  really  nothing  to  do  with  exactly  three  microns.  It  is  just 
that  we  are  using  the  wavelength  of  one  micron  and  the  particular 
distribution  we  are  employing  for  doing  these  retrievals  started  at 
three  microns.  1 micron  = 1 micrometer. 

Reagan:  I have  two  questions  or  comments.  One  on  the  size  dis- 

tribution limit.  We  have  talked  about  this  before.  It  appears 
that  the  combination  of  the  kernel  and  the  specific  shape  of  the 
distribution,  particularly  when  you  get  into  some  of  the  bimodal 
types,  indeed  does  extend  the  range  over  which  you  may  hope  to 
invert  the  size  for  us.  So  we  deal  with,  for  example,  0,4  to  1.0 
micron  wavelength  range  of  measurements  for  radiometer  wrk.  And 
in  the  inversions,  indeed,  for  certain  types  of  size  distributions 
we  seem  to  have  some  success  getting  out  to  three  to  five  microns. 
The  second  thing  is  with  regard -to  refractive  indexes  you  were 
showing  there.  I was  curious  what  the  size  range  assumption  might 
have  been  on  the  Junge  calculations.  For  what  I gave  earlier  in 
the  conference^  for  our  radiometer  work,  dealing  again  from  say  0.4 
to  1.0  micron,  the  refractive  index  seemed  to  be  quite  insensitive 
in  effecting  the  shape  of  the  size  distribution  for  values  running 
from  about  1.33  up  bo  about  1.55.  Those  calculations  were  based 
on  Junge  integrals  from  about  0.02  to  10  microns,  again  for  that 
0.4  to  1.0  micron  range.  Perhaps  it  was  the  wavelength  range  that 
was  giving  the  variation  in  index  that  you  were  showing  there.. 

Fymat:  Indeed,  we  have  discussed  this  problem  several  times.  If 

I go  back  only  to  the  extinction  efficiency  factor,  I have  great 
difficulty  in  trying  to  understand  how  you  could  retrieve  size 
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distribution  up  to  five  microns  or  more  with  the  wavelengths  you 
use.  The  work  of  Yamama  to --Tanaka  showed  that  you  have  to  go  to 
larger  wavelengths  to  be  able  to  sound  the  larger  particle  radii, 

NOW/  if  you  combined  the  extinction  with  other  techniques  to  com- 
pensate in  a sense  for  the  other  wavelengths  you  are  not  using, 
maybe I But,  again,  looking  strictly  at  the  extinction,  I have 
difficulty  understanding  on  the  basis  of  the  physics  of  the  problem 
how  you  can  go  to  five  micron  particles,  approximately,  by  restric- 
ting yourself  to  this  very  narrow  wavelength  range, 

Reagan:  Well,  as  I was  saying,  perhaps  certain  size  distributions 

with  peak  particle  concentrations  near  5.0  microns  may  extend  that 
slightly. 

Fymat:  Slightly  perhaps  up  to  one  to  two  microns,  but  not  to  five 

microns. 

Twomey:  I think  I am  a little  uncomfortable  when  I hear  phrases 

like  "size  distribution"  and  "refractive  index."  I think  people 
should  think  a little  more  of  perhaps  changing  the  wording  a little 
bit  and  saying  "size  and  refractive  index  distribution."  Because, 

I think  we  have  no  guarantee  whatever  that  the  composition  remains 
constant  and  so  I’ll  say  over  a decade  in  size  distribution.  In 
fact,  when  I get  down  towards  0.1  on  the  micron,  we  are  in  a very 
dangerous  region  where  there  are  really  no  good  sources  of  particles 
in  the  atmosphere.  We  are  at  the  lower  tail  of  effectiveness  of 
processes  which  produce  particles  from'  the  bulk,  and  we  are  at  a 
rather  large  size  when  we  are  at  the  upper  tail  of  the  processes 
which  produce  particles  from  the  vapor.  So  we  must  expect,  I 
think,  especially  around  that  size  region,  mixed  composition  and, 
furtheinnore,  a mixed  composition  which  is  size  dependent. 

Fymat:  I fully  agree  with  your  statement. 

Herman:  i think  you  made  a statement  during  your  talk  which  I am 

not  sure  you  meant  to  say,  but  you  said  there  is  no  size  information 
in  the  range  of  0.4  to  1.0  micron, 

Fymat:  No,  that  was  not  the  statement, 

Herman:  i thought  that  was  what  you  said, 

Fymat:  what  I said  was  that,  if  you  look  into  narrow  wavelength 

intervals,  the  corresponding  spectral  ratio  measurements  are  a little 
sensitive  to  the  size  solution.  I am  not  saying  there  is  no  infor- 
mation. As  you  broaden  the  interval,  the  size  sensitivity  increases, 

Herman:  I would  agree  with  you  that  there  is  no  information  out 

beyond  one  or  two  microns  on  the  size  distribution,  but  there  is 
information  from  a few  tenths  of  a micron  out  to  one  or  two  microns. 
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And  I think  theoretically  it  is  easy  to  show  this.  The  fact  that 
the  kernels  do  not  peak  at  those  points  does  not  mean  that  there 
is  no  information. 

Fymat:*  No,  I never  said  that  there  is  no  information.  Also,  keep 
in  mind. that  I am  working  with  extinction  ratios  and  not  the  indi- 
vidual extinctions . 

2 

Kuriyan:  In  our  1974  paper,  we  have  shown  that  if  a Deirmendjian 

haze  H-type  distribution,  with  a variable  parameter  b,  was  assumed, 
then  precise  multispectral  extinction  measurements  can  be  used  to 
infer  the  size  distribution  parameter  b as  well  as  the  complex  index 
of  refraction.  The  errors  in  the  measurement  introduce  corresponding 
errors  in  the  inferred  parameters-  If  a Junge  distribution  is  used, 
then  the  problems  associated  with  the  limits  of  integration  prevent 
the  determination  of  the  index  of  refraction  in  some  cases.  It  seems 
as.  if,  and  no  attempt  to  prove  this  statement  has  been  made,  the 
log-normal  distribution  can  also  be  used  to  infer  the  size  distri- 
bution parameter  and  the  index  of  refraction.  Thus,  extinction 
measurements  in  the  visible  range  cannot  be  used  to  arrive  at  a 
unique  result  but  can  yield,  if  the  haze  H type  of  size  distribution 
is  assumed,  both  the  modal  radius  and  the  index  of  refraction.  It 
must  be  emphasized  that  both  the  log-normal  and  the  haze  H-type 
distributions  are  well  behaved  at  small  and  large  values  of  the 
radius  and,  thus,  the  integrals  can  be  computed  for  the  infinite 
range.  It  seems  reasonable  to  conclude  that  the  origin  of  the 
nonuniqueness  is  the  incomplete  info2nnation  that  is  available  when 
we  restrict  the  measurements  to  the  visible  range.  It  is,  of 
course,  impractical  to  extend  measurements  to  the  entire  spectrimi 
since  absorption  due  to  nonaerosol  matter  will  interfere  with  the 
measurements  - 

Fymat:  I have  also  employed  the  Deirmendjian  distribution.  What 

I have  shown  is  that,  for  such  a distribution,  the  extinction 
ratios  taken  over  a narrow  wavelength  interval,  say  0.45  to  0.70 
microns,  cannot  be  used  to  simultaneously  reconstruct” the  particle 
size  distribution  out  to  several  microns  and  retrieve  the  complex 
refractive  index.  3 Nevertheless,  what  is  interesting  is  that  these 
extinction  ratios  enable  us  to  retrieve  the  complex  refractive  index 
essentially  independently  of  the  size  distribution,  at  least  when 
a gamma  distribution  is  assumed. 


2 
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van  de  Hulst:  In  one  of  your  earlier  slides,  you  had  a number  of 

linear  combinations,  and  one  of  the  linear  combinations  was  between 
the  situation  of  cloud  cover  and  without  cloud  cover,  I remember. 


Fymat:  Yes. 

van  de  Hulst:  Have  you  looked  at  all  into  the  validity  of  such  a 

.linear  combination,  because  it  is  a very  general  question.  If  you 
have  a sky  partly  covered  by  clouds,  of  course,  the  interplay  of 
the  light  is  completely  different  from  what  you  would  have  in  a 
stratified  layer. 

Fymat:  What  I had  in  mind  in  putting  that  linear  combination  was 

really  a masking  effect  of  the  clouds  in  the  same  way  as  we  heard 
earlier  for  the  temperature  sounding  problem,  and  not  so  much  in 
the  interaction  which  takes  place  between  the  clouds  and  the 
radiation  field.  But  I will  have  to  look  more  carefully  into  this 
latter  aspect.  The  other  thing  I would  like  to  point  out  is  in  a 
different  subject.  I think  that  this  kind  of  extinction  experiment 
can  extremely  easily  be  accommodated  by  a SAGE-type  experiment. 

It  is  really  a matter  of  choice  of  wavelengths. 
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APPLICATION  OF  MODIFIED  TWOMEY  TECHNIQUES  TO 
INVERT  LIDAR  ANGULAR  SCATTER  AND  SOLAR 
EXTINCTION  DATA  FOR  DETERMINING 
AEROSOL  SIZE  DISTRIBUTIONS 

B.  M.  Herman 
University  of  Arizona 


It  has  been  shown  theoretically  (Ref.  1)  that  the 
polarization  properties  of  the  angularly  scattered  laser 
light  from  a volume  of  air  may  be  used  to  determine  the 
size  distribution  of  the  aerosol  particles  within  the  volume 
by  the  use  of  appropriate  inversion  techniques.  Similar 
techniques  may  be  employed  to  determine  a "mean"  size  dis- 
tribution of  the  particulates  within  a vertical  column 
through  the  atmosphere  from  determinations  of  the  aerosol 
optical  depth  as  a function  of  wavelength.  In  both  of  the 
examples,  primarily  because  of  the  nature  of  the  kernel  func- 
tions involved,  a modification  of  an  inversion  technique 
originally  described  by  Twomey  (Refs.  2 and  3)  has  been 
employed.  Details  of  this  method  will  be  presented  as 
well  as  results  from  actual,  mhasurements  employing  the 
University  of  Arizona  bistatlc  lidar  and  solar  radiometer. 


I . INTRODUCTION 

With  the  development  in  recent  years  of  high-powered,  pulsed 
laser-radar  (lidar)  systems,  considerable  effort  has  been  directed 
toward  the  problem  of  utilizing  these  systems  to  infer  the  vertical 
structure  of  the  atmosphere  (e.g.  , Refs.  4,  5,  and  6)  . These 
studies  were  based  upon  an  analysis  of  the  backscattered  signals 
only  and,  as  such,  were  quite  limited  in  information  content.  As 
a result,  little  progress  has  been  made  from  such  studies  toward 
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inferring  aerosol  size  distributions  (Ref • 7) - However,  as  has 
been  pointed  out  by  Reagan  and  others  (Refs.  8 and  9),  considerable 
additional  information  may  be  obtained  from  the  angular  scattering 
properties  of  the  atmosphere  by  the  use  of  bistatic  lidar. 
Accordingly,  such  a system  has  been  constructed  at  the  University 
of  Arizona  for  the  purpose  of  studying  the  size  as  well  as  the 
height  distribution  of  atmospheric  aerosols.  The  essential  features 
of  this  system  have  been  described  elsewhere  (Refs.  8 and  9). 

In  another  parallel  line  of  research,  a series  of  solar 
radiometers  have  been  designed  and  constructed  to  measure  atmo- 
spheric turbidity  at  a series  of  wavelengths  throughout  the  visible 
and  near  Infrared  (IR)  portions  of  the  spectrum.  These  instruments 
have  been  described  by  Shaw  and  others  (Refs-  10  and  11)  . The  pur- 
poses of  the  solar  radiometry  have  been  many.  They  provide  a con- 
tinuous monitoring  of  atmospheric  aerosol  optical  depths  through- 
out the  wavelength  range  covered  by  the  filters.  Secondly,  since 
one  of  the  filters  is  at  the  laser  (ruby)  wavelength  of  6943  nm, 
the  optical  depth  of  the  aerosols  at  this  wavelength  provides  a 
means  to  normalize  the  laser  back-scatter  signal,  thereby  pro- 
viding a bettet  means  to  interpret  the  back-scatter  returns  in 
terms  of  the  volume  back-scatter  .coefficient.  This  procedure  has 
been  described  by  Fernald  and  others  (Ref.  12) . An  additional 
bonus  of  solar  radiometry  is  that,  by  a proper  choice  of  filters, 
it  may  be  used  to  infer  vertical  loadings  of  absorbing  gases,  such 
as  ozone  and  H^O.  Finally,  by  virtue  of  the  variation  of  aerosol 
turbidity,  or  optical  depth  with  wavelength,  a second,  completely 
independent  method  of  determining  the  aerosol  size  distribution  is 
provided.  The  distribution  thus  obtained  is  some  type  of  average 
through  the  entire  vertical  column  of  the  atmosphere  but  is, 
nevertheless,  quite  useful-  In  the  following  section,  the  inversion 
methods  used  will  be  outlined,  and  finally,  results  from  actual 
data  will  be  presented. 
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II . THEORETICAL  DEVELOPMENT 


A.  Bi Static  Lidar 

Figure  1 shows  a schematic  diagram  of  the  bistatic  system - 

In  this  diagram,  the  transmitter  sends  out  a pulse  at  an  angle 

to  the  local  normal,  while  the  receiver  views  at  an  angle  y ^ to 

the  local  normal.  The  shaded  area  at  a height  z above  the  ground 

represents  the  instantaneous  scattering  volume,  V,  which  is  that 

volume  from  which  scattered  light,  scattered  through  the  angle  0, 

reaches  the  receiver  at  a given  instant  of  time.  For  the  case  in 

which  the  transmitted  pulse  is  essentially  all  within  the  receiver 

field  of  view,  the  lidar.  equation  for  the  n Stokes  parameter  of 

(s) 

the  scattered  flux  received  by  the  receiver  F^  (0,  R^)  is 


Fig,  i.  Schematic  diagram  of  bistatic  system. 
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where  t (z)  is  the  optical  depth  from  the  ground  to  height  z,  £ is 

the  pulse  length,  A_  is  the  effective  receiver  aperture,  and  F 

R m 

is  the  mth  Stokes  parameter  of  the  transmitted  flux.  The  term 
P^(0)  is  the  nmth  element  of  the  scattering  matrix  for  scattering 
through  the  angle  0 , with  dimensions  of  cross  section  per  unit 
voliome  per  steradian.  For  a more  complete  discussion  of  the  lidar 
equation,  see  Regan  and  Herman  (Ref.  8) . Since  R^  = z/cos  y^,  ’ 
Eq.  (1)  may  be  written  as 
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where  K = , a constant  for  ‘observations  made  at  a constant 

2z2 

height,  z. 


Let  us  now  consider  the  scattering  matrix  element,  P (0) . 

nm 

As  defined,  this  term  has  the  dimensions  of  scattering  cross- 
section  per  unit  volume  per  steradian  and  is  given  by 
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(3) 


where  P (0,  r)  is  the  nmth  element  of  the  scattering  matrix  for 

a particle  of  radius  r,  and  n(r)  is  the  particle  size  distribution 

function.  The  term  P (0)  is  the  nmth.  element  of  the  normalized 

nm  Ray 

scattering  matrix  for  Rayleigh  scattering,  is  the  Rayleigh  mass 

attenuation  coefficient  at  some  standard  level  for  the  wavelength 
X,  while  p and  p(z)  are  the  air  densities  at  the  standard  level 
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and  at  the  height  2,  respectively.  The  integral  on. the  right-hand 
side  can  be  replaced  with  a sum,  such  that  Eq.  (3)  becomes 
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where  P^(6,  r.)  is  the  nmth  element  computed  at  the  midpoint 
radius  of  the  jth  interval ^ and 


f(r  ) = / n(r)dr  (j  = 1,  2,  ...  q) 

^ r . 


(5) 


Substituting  Eq.  (4)  into  Eq.  (2) , the  lidar  equation  for  the  nth 
Stokes  parameter  at  a scattering  angle,  0^,  becomes 
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There  will,  thus,  be  four  equations  of  the  form  of  Eq.  (6),  one  for 
each  of  the  four  Stokes  parameters  (i.e.,  for  each  value  of 
n = 1,  2,  3,  4),  for  a given  angle,  0^.  Thus,  the  total  number  of 
such  equations  will  be  4 x number  of  scattering  angles  at  which 
observations  are  made.  In  general,  we  may  write 

Af  = g (7) 

where  g is  a vector  of  the  observations,  with  components  given  by 
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and  of  dimensions  4 x where  is  the  number  of  scattering  angles, 
0^,  at  which  observations  are  made.  We  have  here  assumed  that  the 
optical  depth,  t (z) , is  known  so  that  the  Rayleigh  contribution 
term  may  be  computed  and  subtracted  from  the  observations.  The 
value  of  x(z)  may,  in  fact,  be  determined  to  within  ± 1%  accuracy 
from  other  measurements,  the  technique  for  which  will  be  described 
in  a later  paper.  The  vector  f is  the  vector  of  the  unknowns  to 


be  solved  for,  with  components  f(r^),  f(r^). 


f(r  ),  where 

q 


each  component  gives  the  number  of  particles  within  the  particular 
increment  of  radius,  as  given  by  Eq.  (5) . The  coefficient  matrix, 
A,  is  composed  of  known  quantities,  the  elements  of  which  are 
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Thus,  the  A matrix  will  have  dimensions  4Z  x q,  where  4£  is  the 

total  number  of  obseirvations , and  q is  the  number  of  unknowns, 

the  f(rj.  The  individual  elements  of  the  P scattering  matrix 

for  q given  value  of  0 and  r,  P (0. , f .),  are  determined  from  Mie 

nm  1 j 

theory,  while  the  Stokes  parameters  of  the  transmitted  light,  the 

, are  assumed  to  be  [1/2,  1/2,  1,  0]  in  order  that  the  trans- 
mitted light  be  linearly  polarized  at  45°  to  the  scattering  plane. 


In  reality,  since  measurement  errors  are  always  present,  as 
are  quadrature  and  roundoff  errors  in  the  numerical  evaluation  of 
Eq.  (3)  (i.e.,  Eq.  (4)),  Eq.  (7)  should  be  written  as 

Af  = g + e (9) 


where  the  components  of  e are  the  errors  in  each  equation.  Because 
of  the  presence  of  errors,  Eq.  (9)  no  longer  possesses  a unique 
solution,  and  attempts  at  a direct  solution  with  e set  equal  to 
zero  almost  always  result  in  poor,  highly  oscillatory  solutions, 
as  shown  by  Phillips  (Ref.  13).  Twomey  (Refs.  2 _and  3)  d«tvonstrated 
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possible  constraints  which  are  applicable  to  problems  of  this  sort. 

One  of  these  constraints  is  in  the  form  of  a smoothing  constraint 

in  which  the  second  derivative  of  the  solution  points 

9^f(r.)/3r.^  is  minimized.  This  constraint  leads  to  the  solution 
3 3 

f = (a'^A  + YH)”^A^g  (10) 

T ... 

in  which  A is  the  transpose  of  the  A matrix,  H is  a smoothing 

matrix,  and  y is  a Lagrangian  multiplier.  The  value  of  y deter- 
mines the  amount  of  smoothing,  and  is  so  chosen,  by  experimenta- 
tion, to  give  reasonable  solutions.  The  value  of  y is  not  overly 
critical,  in  the  sense  that  varying  it  from  the  chosen  value  over 
a range  of  an  order  of  magnitude  does  not  seriously  affect  the 
solution. 

Another  possible  constraint  proposed  by  Twomey  for  those  cases 
in  which  a reasonable  estimate  of  the  solution  is  known  consists 
of  minimizing,  in  a least  squares  sense,  the  differences  between 
the  actual  solution  and  the  initial  estimate,  or  trial  solution. 

This  leads  to  the  solution 

f = (a'^A  + y1)”^(a\  + YP)  (11) 

^ ^ 

where  I is  the  identity  matrix  ‘and  P is  the  vector  of  the  estimated 
solution  points. 

In  the  present  study  utilizing  angular  scattering  data,  the 

elements  of  the  A matrix  given  by  Eq.  (8)  are  composed  of  elements 

of  the  scattering  matrix,  P (0,  r) . For  values  of  r < A,  where 

nm 

X is  the  wavelength  of  the  incident  light  (in  this  case,  X=0.6943  ym, 

representative  of  a ruby  laser),  P (0,  r)  is,  indeed,  a smooth 

nm 

function  of  r.  This  is  demonstrated  in  Fig.  2,  which  shows  P^^ 
for  0 = 130^,  for  radii  between  O.Olym  and‘ 0.190  ym.^  However,  as 
r increases,  the  matrix  elements  become  an  increasingly  erratic 
function  of  r,  as  demonstrated  in  Fig.  3,  which  shows  the  same 
element  for  radii  between  1.20ym  and  1. 5Q  ym..  Therefore,  in  order 
to  get  an  accurate  numerical  representation  of  Eq.  (3) , it  is 
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and  1.50  ym 


necessary  to  break  the  integral  up  into  a large  number  of  very 
small  intervals  (Ref.  14) . For  the  range  of  sizes  considered 
in  this  work,  approximately  1000  ^intervals  in.  r were  considered 
necessary  in  order  to  assure  a reasonably  accurate  representation 
of  the  integral  in  Eq.  (3).  This,  then,  would  require  1,000  values 

of  the  unknown  f(r.),  as  can  be  seen  from  Eqs.  (4)  and  (5),  which, 

^ T 

in  turn,  would  require  the  inverse  of  a matrix,  A A,  as  in  Eqs. 

(10)  and  (11)  of  dimensions  1000  x 1000.  In  order  to  simplify 

this  process,  the  following  technique  was  employed.  Each  of  the 

relatively  large  intervals  in  Eq.  (4)  was  broken  down  into  a series 

of  smaller  sub- intervals  such  that  Eq.  (4) , less  the  Rayleigh  term, 

becomes 


P (0.)  S f (r, ) y P (6. , r.  )W,  (r,  ) + 

nm  1 aer  1 “ nm  i k,  k-  k, 

k 111 
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q S nm  X 


q q q 


(12) 


where  the  (r ^ ) are  weighting  functions  to  be  described  below, 

and  the  subscript  aer  indicates  that  part  of  the  scattering  matrix 
element  due  to  aerosols  only.  The  coefficients  of  the  unknown 
f(fj)  in  Eq.  (12)  will  then  appear  in  the  expression  for  the  coef- 


ficient matrix,  Eq.  (8),  in  place  of  the  P (0.,  r.) 

nm  1 j 

example,  the  expression  for  A^^  becomes 
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Initially,  the  weighting  functions  are  unknown,  as  knowing 
them  would  be  equivalent  to  knowing  the  sought  after  size  distri- 
bution function.  Thus,  an  "intelligent”  first  guess  is  made,  and 
the  problem  reduces  to  one  of  finding  the  unknown  f (r^)  which, 
when  multiplied  by  the  assumed  weighting  functions,  gives  the 
sought  after  size  distribution  function.  If  the  initially  assumed 
weighting  functions  are  exact,  the  solution  vector  f will  be  a 

unit  vector  (i.e.,  =1/  j =1/  2, q)  . Any  errors  in  the 

initial  guess  of  the  weighting  function  will  be  mathematically 
indistinguishable  from  measurement  errors  and,  therefore,  the 
worse  the  initial  estimate,  the  worse  will  be  the  resulting  solu- 
tion. However,  this  first  solution  will  still  be  a better  approxi- 
mation than  the  first  guess,  and  so  the  process  is  repeated.  For 
the  second  iteration,  however,  the  first  solution  is  used  as  the 
weighting  function,  and  a new  f'(r.)  is  obtained.  For  the  second 
iteration,  the  first  solution  vector  components  are  assumed  to  be 
valid  at  the  mid-point  of  each  of  the  large  intervals  in  r,  and 
are  connected  by  straight  lines.  Thus,  for  the  second  iteration, 
Eq.  (12)  becomes 


P (6.)  = f*(rj  I P (0.,  ) <f(r,)  + 

nm  1 aer  ^ t nm  i k/  1 1 
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where  Ar,  ...  Ar  are  the  increments  of  Ar  for  the  large  inter- 

i q-1 

vals,  while  the  Ari.  are  the  intervals  of  Ar  in  the  subintervals, 

j 

and  the  f*(r^)  are  the  components  of  the  new  solution  vector.  In 
principle,  one  could  continue  the  iterative  process  until  two 
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successive  solution  vectors  agreed  to  within  any  pre-specified 
amount.  In  practice,  it  was  found  that  two  such  iterations  were 
adequate . 


Since  most  measurements  of  the  size  distribution  of  continental 
aerosols  seem  to  indicate  that  these  functions  follow  typically 
what  is  known  as  a Junge  distribution  (Refs.  15  and  16)  , given  by 


dn 

dr 


cr 


(V*  + 1) 


(15) 


a reasonable  first  guess  for  the  weighting  functions  would  be  to 
assume  that  the  distribution  function  is  of  the  form  given  by 
Eq.  (15) . Thus,  W is  given  by 
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where  c is  a normalizing  constant  determined  such  that  the  total 
integral  of  Eq.  (15)  over  all  sizes  yields  the  proper  nxamber  den- 
sity per  unit  volume,  while  v*  is  a shaping  constant.  Typical 
values  of  v*  lie  in  the  range  of  2.0  to  4.0. 


In  the  theoretical  work  to  follow,  "measurements"  of  the  four 
Stokes  parameters  at  five  scattering  angles,  0,  at  fixed  height  z, 
and  for  various  assumed  size  distributions,  were  computed  from 
Eq.  (2) . These  twenty  "observations"  were  then  used  per  ssr  and 
with  introduced  random  errors  of  1%  and  2%,  in  order  to  compute 
20  inversion  solution  points,  the  f(r^),  by  the  tecipiigue  des- 
cribed cibove.  These  solution  points  were  taken  "to  apply  over  each 
of  20  large  intervals  in  Ar,  taken  such  that  log  Ar  = Constant  for 
all  intervals.  The  volume  scattering  matrix  elements  due  to 

aerosols,  P (6^)  , were  computed  from  Mie  theory  with  a numerical 

nm  aer»  ^ ^ 
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approximation  to  Eq.  (3)  , while  the  Rayleigh  component  was  com- 
puted assuming  a standard  atmospheric  density  distribution. 

Figure  4 shows  an  inversion  solution  for  a true  distribution 
which  is  a straight  Junge-type  with  v*  = 2.5.  The  initial  weighting 
functions  were  computed  from  Eq.  (16)  assijming  v*  = 3.5  (long 
dashes)  . The  initial  smoothing  solution,  Eq.  (10)  , is  shown  as 
short  and  long  dashes,  while  the  final  solution,  Eq.  (11)/  is 
shown  as  short  dashes.  For  the  sake  of  clarity,  the  second 
iteration  using  the  smoothing  constraint  has  been  omitted  from 
this  figure  and  from  Fig.  5,  but  in  both  cases  lies  roughly  inter- 
mediate between  the  first  and  final  solutions.  In  this  case,  a 
nearly  perfect  inversion  is  obtained  for  radii  greater  than  about 
0.2  . It  is  important  to  note  here  that  "exact  observations”  were 


Fig.  4.  Inversion  results  for  zero  error  in  observations  for 
Junge-type  distribution. 
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used  in  this  case?  that  is,  observations  precisely  as  ccxnputed 
were  used.  In  the  smaller  range  of  sizes,  results,  while  satis- 
factory, are  not  quite  as  good.  This  is  undoubtedly  due  to  the 
fact  that  the  smaller  sizes  are  in  the  Rayleigh  region  for  the  ruby 
wavelength,  which  results  in  all  such  particles  having  the  same 
variation  of  the  scattering  matrix  elements  with  scattering  angle, 
6.  Thus,  the  only  information  content  in  the  measurements  for 
these  small  particles,  if  they  were  precisely  Rayleigh  scatter ers, 
is  the  total  scattering  cross  section,  and  this  can  be  made  up  of 
large  numbers  of  the  smallest  particles,  or  fewer  numbers  of  the 
larger  particles  of  the  Rayleigh  region,  or  any  intermediate  dis- 
tribution. Since  the  scattering  differs  slightly  from  pure 
Rayleigh,  there  is,  in  fact,  a slight  amount  of  information  con- 
tent in  the  measurements,  as  can  be  seen  from  Fig.  5.  In  this 
figure,  the  true  distribution  of  particles  is  a modified  Junge 


Fig-  5.  Inversion  results  for  zero  error  in  observations 
for  modified  Junge  distribution. 
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distribution,  called  distribution  X,  in  which  a parabola,  given  by 
a dn/dr  = a(r^  - r)  + b was  assumed  for  r £ 0.103  and  a Junge 
distribution  with  v*  = 2.5  was  assumed  for  r ^ 0.103  y.  The  con- 
stants a and  b were  determined  by  setting  dn/dr  and  d/dr (dn/dr) 
equal  for  both  distributions  at  r = 0.103  y.  As  an  initial  guess 
for  the  weighting  functions,  a straight  Junge  distribution  with 
V*  = 2.0  was  employed.  Again,  zero  error  was  placed  in  the  obser- 
vations. As  can  be  seen  from  the  figure,  an  excellent  solution 
resulted,  even  at  the  smaller  sizes  where  the  distribution  departed 
markedly  from  a Junge  type. 

Figures  6,  7,  and  8 present  results  in  which  random  errors 
of  1%'  and  2%  were  put  into  the  observations,  in  addition  to  the 
”zero  error"  results.  For  the  sake  of  clarity,  only  the  final 
solutions  are  shown  in  these  figures.  Figure  6 is  for  a straight 


Fig,  6.  Inversion  solutions  for  a Junge  distribution  with 
V*  = 2.5f  with  zeror  1%  and  2%  errors  in  the  observations. 
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Junge  distribution  with  v*  = 2.5,  As  can  be  seen,  the  presence 
of  errors  up  to  2%  degrades  the  solution  only  slightly,  and,  in 
general,  the  results  show  excellent  agreement  between  the  true  and 
computed  distributions.  Figure  7 shows  results  for  distribution 
X except  for  these  calculations,  a much  worse  initial  guess  of 
V*  = 4.5  was  used  as  compared  with  the  results  shown  in  Fig.  5. 

By  comparison  of  the  final  solution  with  no  error  in  Fig,  7 to  the 
final  solution  of  Fig.  5,  It  can  be  seen  that  this  worse  initial 
guess  definitely  leads  to  a poorer  final  solution,  particularly  at 
the  smallest  sizes.  Even  so,  the  final  solutions  must  be  con* 
sidered  as  quite  satisfactory,  with  no  appreciable  degradation 
with  the  introduction  of  measurement  errors,  except  again  at  the 
smaller  sizes.  Apparently  the  introduction  of  even  small  errors 
into  the  observations  is  enough  to  essentially  eliminate  all  infor- 
mation content  about  the  smallest  sizes,  due  to  the  initially  low 
content  inherent  in  these  sizes  as  previously  discussed. 

Finally,  in  Fig.  8,  results  are  presented  for  another  dis- 
tribution, distribution  Y.  For  this  distribution  X was  used  for 
r 1.3  yra-  This  was  fitted  to  a Junge  distribution  with  v*  = 3.0 
for  1.3ym5  r ^ 3-4  pm,  while  for  r > 3.4  pm  a Junge  distribution 
with  V*  = 1.1  was  assumed.  The  results  again  are  considered  quite 
satisfactory.  For  zero  error  and  1%  error,  it  is  noted  that  the 
solution  is  able  to  follow  the  abrupt  change  in  slope  at  3-4  pm, 
while  the  2%  error,  the  constraint  required  (i.e.,  the  value  of 
Y in  Eqs.  10  and  11)  is  large  enough  so  as  to  force  the  solution 
into  a nearly  straight  line  on  the  log-log  plot  of  the  figure - 
This  is  true  at  the  small-size  end  of  the  distribution  also.  Here, 
the  zero  and  1%  error  solutions  are  able  to  somewhat  follow  the 
curvature  of  the  true  solution,  but  with  2%  error  this  feature  is 
completely  lost. 
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B . Solar  Radiometer 


As  mentioned  earlier,  values  of  the  aerosol  optical  depth 
as  a function  of  wavelength  also  contain  considerable  information 
concerning  the  aerosol  size  distribution  and,  therefore,  inversion 
techniques  may  be  employed  in  order  to  recover  this  information. 
The  integral  equation  which  relates  aerosol  optical  depth  to  an 
aerosol  size  distribution  can  be  written  as 


T 

aer 


(X) 


.00 

■D 


7Tr2Q^(r, 

(D 


X,  m)n(r,  z)dzdr 


(17) 


where  n(r,  z)dr  is  now  the  height  dependent  aerosol  size  distri- 
bution, m the  complex  refractive  index  of  the  aerosols,  and 
Q^(r,  X,  m)  the  extinction  efficiency  factor  from  Mie  theory. 

Upon  performing  the  height  integration,  Eq.  (17)  can  be  rewritten 
as 


T (X) 
aer 


r 

7Tr^Q^(r,  X,  m)n^(r)dr 
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(18) 


where  n (r)  is  the  columnar  size  distribution,  i.e.,  the  momber  of 
c 

aerosols  per  unit  area  per  unit  radius  interval  in  a vertical 

column  through  the  atmosphere.  Thus,  n^(i^)  is  the  unknown  to  be 

determined  through  measurements  of  t (X),  Trr^Q.  (r,  X,  m)  being 

aer  t 

the  known  kernel  function  in  this  case. 


In  obtaining  n^(r),  the  integral  in  Eq.  (10)  is  again  replaced 
by  a summation  over  coarse  intervals  in  r,  each  of  which  is  com- 
posed of  several  subintervals  as  described  in  the  previous  section. 
In  order  to  examine  the  specific  kernel  functions  which  result  if 
that  procedure  is  applied  to  the  present  problem,  let  = 

W(r)f(r)  where  W(r)  is  a rapidly  varying  function  of  r while  f (r) 
is  varying  more  slowly.  With  this  substitution,  Eq.  (18)  becomes 
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where  the  limits  have  been  made  finite  with  r,  = r and  r . = 

1 a q + 1 

r,  r and  f (r)  is  assumed  constant  within  each  coarse  interval.  A 

D 

matrix  equation  of  the  form  of  Eq.  (7)  thus  results  where 


g.  = T (X. ) 
1 aer  i 
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irr^Q^Cr,  m)W(r)dr 


(j  = 1,  2,  ...  q)  (20) 


Writing  Eq.  (20)  as  a quadrature  over  the  subinteirval  from  r^  to 
r^  ^ results  in  an  expression  with  weighting  functions 
Wkj(^kj)  = W(rkj)  Ar . and  are’  the  midpoint  radii  of  each  of 

the  k sTobintervals,.  and  are  employed  in  an  identical  manner  to 
those  previously  described.  The  unknowns  to  be  solved  for  are  the 
f(fj),  one  value  for  each  of  the  larger  intervals. 


The  initial  weighting  functions  were  again  assumed  to  be  a 
Junge  size  distribution  given  by  Eq.  (15) . In  practice,  several 
different  values  of  v*  are  used  to  calculate  the  zeroth  order 
weighting  function  and  the  final  results  after  successive  itera- 
tions are  intercompared . . One  test  of  the  procedure  is  the  simi- 
larity of  the  results  obtained  when  different  values  of  v*  are 

used  for  the  W,  . 

k . 


3 


In  both  inversion  problems  described,  a proper  selection  of 
the  magnitude  of  the  constraint,  3,  is  of  some  importance. 

Although  ideally  the  value  of  3 is  closely  related  to  the  expected 
values  of  the  errors,  E,  in  practice  it  has  been  found  that  a 
selection  of  3 on  this  basis  is  not  always  adequate.  The  procedure 
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used  in  the  work  to  follow  has  been  to  vary  9 through  a broad 

range  of  values  about  its  expected  value  from  an  estimate  of  the 

errors  in  order  to  obtain  a minimum  value,  d , which  yields  an 

mm 

acceptable  solution.  An  acceptable  solution  is  one  in  which  all 
values  of  the  unknown  are  positive  numbers.  Values  of  9 too  small 
normally  result  in  highly  oscillatory  solutions  and  physically 
unacceptable  negative  values  for  some  of  the  f's,  while  too  large 
a value  of  3 obviously  will  cause  the  solution  vector  to  be  domi- 
nated by  the  constraint - 

Figure  9 presents  kernel  functions  7rr^Q^(r,  X,  m)W(r)  for 


RADIUS^  /tm 


Fig.  9.  Kernel  functions  (trr^Q^)  for  four  values  of  X and 

c 

for  m = 1.45-00  i'. 
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Fig.  10^  Modified  kernel  functions  for  the  last  iteration  in 
the  inversion  scheme  resulting  from  the  size  distribution  as  given 
in  the  inset. 

four  values  of  X and  for  m = 1. 45-00  i.  For  this  case  W(r)  = 
cr  +1)  V*  = 3.0.  As  can  be  seen  from  these  kernels,, 

there  apparently  is  information  as  to  particle  number  between  radii 
of  a few  tenths  of  a micrometer  out  to  approximately  1 micrometer 
However,  with  the  present  technique,  successive  iterations  produce 
new  kernels  which  are  the  products  of  the  old  kernels  and  the 
inverted  size  distribution  from  the  previous  iteration.  Figures 
10  and  11  present  kernels  for  the  last  iteration  for  two  actual 
cases.  Figure  10,  for  October  24,  1975,  presents  such  a kernel 
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Fig.  11.  Modified  kernel  functions  for  the  last  iteration  in 
the  inversion  scheme  resulting  from  the  size  distribution  as  given 
in  the  inset. 

where  the  weighting  function  on  the  last  iteration  is  as  given  by 
the  inset  in  the  upper  left-hand  corner.  In  this  case,  the  range 
of  information  content  is  actually  smaller  than  the  starting  ker- 
nel function  (Fig..  9).  However,  this  kernel  is  a result  of  the 
rather  narrow  size  distribution,  log-normal  in  type,  depicted  in 
the  insert,  and  one  would  not  expect  information  over  a broader 
range  of  sizes. 

Figure  11  shows  kernels  resulting  from  a rather  common  type 
of  size  distribution  function  commonly  observed  in  our  work  to 
date.  It  has  the  appearance  of  a Junge  type  of  distribution  with 
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a log-normal  type  superimposed.  The  kernel  functions  resulting 
from  this  type  show  a marked  contrast  to  the  previous  two  figures. 
There  appear  to  be  two  regions  of  maximum  information  content,  one 
for  radii  somewhat  less  than  0.1  ym,  and  another  for  radii  between 
about  0.7  ym  to  about  1.5  ym.  Under  any  conditions,  it  is  apparent 
that  with  the  current  selection  of  wavelengths,  the  solutions  are 
limited  in  information  to  sizes  below  approximately  1 to  1.5  ym. 
Inversion  results  for  sizes  beyond  this  (for  the  case  of  solar 
radiometry)  are  dictated,  for  the  most  part,  by  the  constraints 
that  are  applied. 


III.  RESULTS 

In  this  section,  results  from  actual  data  will  be  presented - 
In  the  case  of  radiometer  results,  large  numbers  of  successful 
inversions  have  been  obtained  from  which  only  a few  typical  ones 
have  been  selected.  A limited  number  of  bistatic  lidar  inversions 
obtained  from  two  cases  are  selected  on  days  when  both  radiometer 
and  lidar  inversions  were  obtained. 

Figure  12  shows  a plot  of  the  measured  aerosol  optical  depth 
as  a function  of  X on  the  left-hand  side  and  the  resulting  inverted 
size  distribution  function  on  the  right.  The  three  different 
curves  are  for  three  different  initial  weighting  functions,  W(r), 
in  each  case  a Junge  distribution  with  exponents  v*  = 3.05,  3.55, 
and  4.05.  The  three  different  solution  are  presented  in  order  to 
demonstrate  the  insensitivity  of  the  final  results  to  the  initial 
guess.  Although  all  cases  do  not  show  this  excellent  agreement,  in 
most  cases  the  agreement  is  considered  to  be  more  than  adequate . In 
general,  the  solution  which  produces  values  of  the  aerosol  optical 
depth  in  best  agreement  with  the  measurements,  in  a least  squares 
sense,  is  the  one  accepted.  Figure  12  demonstrates  a size  distri- 
bution function  fairly  close  to  a Junge  type.  Aerosol  optical 
depths  can  be  seen  to  be  around  0.10  in  the  mid-visible  and  demon- 
strate a nearly  linear  decrease  with  A. 
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Fig.  12.  Left-hand  side  shows  measured  aerosol  optical  depth 
as  a function  of  X while  the  right-hand  side  shows  the  resulting 
inversion  for  aerosol  size  distribution  for  August  13,  1975  data. 


Figure  13  shows  quite  a different  situation-  For --this  case, 

the  aerosol  optical  depth  was  extremely  low,  about  0-03  in  the 

red-visible,  and  shows  a nonlinear  increase  with  X.  This  type 

of  T vs.  X behavior  results  in  the  log-no2rmal  type  of  distri- 
aer 

bution  depicted  in  the  right-hand  side  of  the  figure.  Again,  the 
three  initial  first  guesses  yield  solutions  in  excellent  agree- 
ment. The  sharp  peak  in  this  size  distribution  function  at  about 

r = 0.8  ]im  would  give  use  to  a peak  in  the  x vs.  X curve  at  a 

aer 

value  of  X slightly  greater  than  l.Oym  due  to  the  first  peak  in 

the  Mu  values  for  Q„.  The  measured  values  of  x seem  to  ind±- 

^T  ^ aer 

cate  a peak  would  be  reached  in  this  region  if  measurements  were 
extended  out  to  longer  wavelengths. 

Figure  14  shows  yet  another  quite  common  size  distribution 

function  commonly  observed  over  Tucson,  Arizona.  It  appears  to 

be  a Junge  type'* with  a log-normal  type  superimposed.  The  x vs. 

aer 
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Left-hand  side  shows  measured  aerosol  optical  depth 


as  a function  of  X while  the  right-hand  side  shows  the  resulting 
inversion  for  aerosol  size  distribution  for  October  24,  1975. 


Fig.  14.  Left-hand  side  shows  measured  aerosol  optical  depth 
as  a function  of  X while  the  right-hand  side  shows  the  resulting 
inversion  for  aerosol  size  distribution  for  November  20,  1975. 


X measurements  which  give  rise  to  this  type  of  distribution  are 
curved  in  an  opposite  sense  to  those  of  Figure  13,  as  can  be  seen 
from  the  figure-  The  three  initial  guesses  again  yield  satisfactory 
agreement . 

Figure  15  shows  a series  of  size  distribution  functions  as 
determined  on  four  successive  days.  Although  the  values  of  t 

aer. 

were  quite  different  on  each  of  the  days,  they  all  exhibit  a simi- 
lar shape,  consistent  with  the  x vs.  X curves  shown  on  the  left- 

aer 

hand  side  of  the  figure. 

Figure  16  demonstrates  another  series  of  three  days.  The 
first  day  of  this  series  was  extremely  clear  with  very  low  aerosol 
optical  depths  and  was  followed  by  a mild  episode  of  blowing  dust 
on  the  second  day  which  resulted  in  aerosol  optical  depths  and  was 
followed  by  a mild  episode  of  blowing  dust  on  the  second  day  which 
resulted  in  aerosol  optical  depths  of  about  0.15  to  0.20.  On  the 


Fig.  15-  Measured  aerosol  optical  depths  and  resulting  inver- 
sions for  aerosol  size  distribution  on  four  successive  days. 
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Fig.  16.  Measured  aerosol  optical  depths  and  resulting  inver 
sions  for  aerosol  size  distribution  for  a three-day  period  in 
which  a dust  storm  occurred  on  the  second  day. 

third  day  the  optical  depth  recovered  to  almost  its  previous  level 
but  its  variation  with  X was  considerably  different.  The  size 
distributions  resulting  from  inverting  these  three  days  are  quite 
interesting.  On  the  first  day,  prior  to  the  blowing  dust  episode, 
a fairly  common  size  distribution  was  in  evidence,  a Junge  type 
with  a log-normal  superimposed.  During  the  blowing  dust  episode, 
this  changes  to  a very  broad,  log-normal  type  with  a peak  at  a few 
tenths  of  a micrometer.  On  the  third  day,  the  size  distribution 
function  is  again  log-normal  in  type,  but  much  narrower  and  with  a 
peak  at  about  0.6  um. 

There  is  undoubtedly  much  information  to  be  gleaned  from  con- 
tinued studies  of  these  size  distribution  functions  and  their 
variations  with  meteorological  conditions  and  their  evolution  with 
time.  These  results  are  presented  here  only  to  indicate  the  type 
of  information  which  may  be  obtained  from  inversions  such  as  these 
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Fig.  17.  Inversion  for  aerosol  size  distribution  on  the  same 
day  as  obtained  from  bistatic  lidar  and  solar  radiometer. 

Finally,  in  Figs.  17  and  18,  inverted  aerosol  size  distribution 
functions  are  presented  for  two  days  comparing  results  obtained 
from  solar  radiometer  data  with  those  obtained  from  bistatic  lidar 
data.  In  Fig.  17,  both  inversions  resulted  in  size  distribution 
functions  similar  to  the  Junge -log-normal  types  presented  earlier. 
While  the  agreement  between  the  two  sets  of  data  is  not  perfect,  it 
is  felt  that  they  are  similar  enough  to  lend  support  as  to  the 
reliability  of  this  work.  In  Fig.  18,  in  addition  to  the  lidar  and 
radiometer  inversions,  a third  distribution  is  presented.  This  dis- 
tribution represents  the  besvt  fit  to  the  optical  depth  measurements 
that  could  be  obtained  from  a two-slope  size  distributibn  function, 
the  details  of  which  need  not  be  given  here.  In  this  case,  all 
three  methods  yield  almost  identical  size  distribution  functions, 
which  on  this  particular  day  is  a "modified"  Junge  type. 
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Fig,  18,  Inversion  for  aerosol  size  distribution  on  the  same 
day  as  obtained  from  blstatlc  Ildar  and  solar  radiometer  except  for 
a different  day  and  as  a third,  best  fit,  distribution  also. 


The  previous  examples  are  presented  in  order  to  demonstrate 
the  potential  utility  of  remote  sensing  and  inversion  techniques. 
Aerosol  size  distribution  functions  are,  at  best,  difficult  to 
measure  directly.  Furthermore,  direct  measurement  methods  are  gen- 
erally much  more  time  consuming  and  costly,  factors  which  render 
more  or  less  continuous  monitoring  almost  impossible.  With  the 
remote  sensing  techniques  described  here,  continuous  monitoring, 
weather  conditions  permitting,  is  almost  as  easy  as,  say,  monitoring 
solar  radiation.  It  is  felt  that  with  the  greater  abundance  of  data 
which  is  available  through  these  techniques,  it  will  be  possible  to 
learn  more  about  the  properties  of  atmospheric  aerosols  in  a shorter 
time  than  has  heretofore  been  possible. 
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DISCUSSION 


Pearce:  That  dust  stom  graph — I was  sort  of  surprised  to  see  the 

small  particles  having  been  swept  away  by  the  storm.  Do  you  have 
a lot  of  confidence  that  that  is  a real  effect? 

Herman:  This  particular  dust  storm  was  not  locally  generated  as 

you  can  tell  by  the  optical  depth.  The  most  common  dust  storms  in 
Arizona  are  those  generated  in  the  California  desert  and  they  blow 
in  from  there.  We  do  on  occasion  get  locally-generated  dust  but 
on  this  particular  case  the  optical  depth  was  about  0.15  which  indi- 
cated that  it  was  from  a distant  source — relatively  distant  as 
opposed  to  blown  up  locally.  The  only  explanation  that  one  could 
give  for  it  is  that  some  of  the  large  particles  have  fallen  out 
and  some  of  the  smaller  ones  had  coagulated.  The  drop  off  here  I 
can't  explain  other  than  that's  what  we  got.  Now,  it  may  be  just 
a poor  inversion;  I don't  know,  but  we  tend  to  believe  it  until  -we 
find  reason  not  to.  On  the  other  side,  however,  there  are  just 
larger  niamber  densities  but  the  shape  from  the  peak  on  over  is 
exactly  what  we  get  typically  from  which  we  think  is  locally  gen- 
erated dust.  What  our  feeling  is  that  this  is  a dust  storm  from  a 
distant  source,  but  I can't  explain  why  the  drop  off  on  the  left. 

But  on  the  right  I think  it  is  just  simply  a case  of  most  of  the 
large  particles  have  fallen  out  and  that  is  just  what  is  left  over. 

Twitty:  My  question  is  really  the  same  thing,  because  that  really 

bothers  me.  Current  theories  say  that  the  large  particles  are, 
in  fact,  locally  generated  dust  or  locally  generated  surface  sources 
of  some  kind  and  the  small  particles  are  of  a different  origin  and 
that  the  two  are  more  or  less  independent.  So  we  expect  this  other 
bump  just  to  add  on  to  the  one  that  is  already  there.  So  the  ques- 
tion is,  what  is  your  source  of  that  small  particle  set?  How  do 
you  account  for  the  Rayleigh  scattering? 

Herman:  We  have  a calibrated'  lidar. 

Twitty:  It  is  not  lidar  though. 

Herman:  Excuse  me.  Okay,  this  is  not  the  comparison.  The  cali- 

bration on  the  radiometer  is  done... I can't  tell  you  right  now. 

I can't  remember  how  it  is^done.  The  radiometer,  however,  is  cali- 
brated to  where  we  know  the  signal  due  to  the  Rayleigh  amount.  It 
is  calibrated  so  that  we  can  subtract  out  the  Rayleigh  scattering 
from  it.  The  lidar  is  calibrated  with  a pulse  tq  pulse  calibration, 
which  gives  us  a monitor  of  the  power  in  each  pulse  and  then  we 
calibrate  it  at  the  highest  level  for  the  Rayleigh  calibration. 

Twitty:  Now  what  really  bothers  me,  you  go  back  to  your  weighting 

function  for  that  inversion.  I don't  know  whether  you  can  find 
that  easily. 


SOX) 


Herman:  I don't  think  I have  one  for  that  day. 

Twitty:  No  but  you  have  one,.. 

Herman:  That  type  of  size  distribution? 

Twitty:  That  is  similar.. 

Herman:  The  one  with  a large  amount  of  particles?  (showing  graph^) 

Twitty:  Yes,  And  the  effect  which  of  course  produces  the  rapidly 

decreasing  distribution  causes  the  number  1/  which  is  the  short 
wavelength^  2,  3,  4f  to  decrease  very  rapidly  in  the  weighting 
function.  It  could  be  explained  if  you  are  not  subtracting 
sufficient  amount  of  Rayleigh.  That's  the  equivalent  to  if  you 
have  a little  bit  less  Rayleigh  at  number  1,  which  is  0.44  ym^, 
then  that  would  bring  that  whole  signal  down. 

Herman:  Oh,  yes,  improper  calibration  could  cause  this  to  happen. 

Twitty:  It  could  have  a very  great  effect  on  that  part. 

Herman:  I don't  argue  that  point.  It  is  certainly  true.  We  feel 

we  have  the  best  possible  calibration  that  can  be  done  on  it. 

Now  I am  not  saying  this  is  gospel.  This  is  the  result  of  what 
we  have  been  getting.  We  are  aware  of  these  problems — calibration 
problems  and  so  forth,  and  any  of  these  things  can  throw  this 
inversion  or  any  inversion  off.  And  you  are  absolutely  right. 

But  then  this  would  have  been  the  case  on  the  other  days  when  we 
didn't  get  the  small  particles. 

Twitty:  Not  if  on  those  days  you  have  substantially  greater 

optical  depth  so  that  it  essentially  swamped  out  that  day. 

Herman:  I don't  know  what  the  optical  depth  was  on  this  day.  I 

don't  think  it  was  listed.  But  we  cain  find  it  though — at  least 
we  can  find  comparisons.  They  are  not  that  much  different.  The 
worst  optical  depth  case  we  presented  was  0.15;  most  of  them  are 
in  the  0.05  to  0.1.  Now  this  case  was  not  a 0,3  or  0.4  optical 
depth  case, 

Twitty:  That  case  was  a very  thin  optical  depth  case,  right? 

Herman:  This  was  thin,  but  the  dust  case  was  about  0.15,  Now 

when  we  have  locally  generated  dust  storms  we  get  up  to  0.3  or 
0.4  or  higher.  The  0.15  is  moderately  hazy.  It  is  not  extreme. 

I wouldn't  call  that  perturbed. 


^See  Fig,  11  in  this  paper. 
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Twitty:  That  is  still  consistent  with  my  concern  about  the  cali- 

bration. And  I might  make  a comment  about  these  kinds  of  plots, 
because  I think  those  kinds  of  plots  are  really  excellent.  I 
looked  at  those  a lot  when  I was  doing  inversions,  because  your 
weighting  functions  like  that  are  really  your  distribution  of  the 
signal  with  respect  to  the  size  you  are  trying  to  determine.  And 
you  can  pick  right  off  of  that  where  you  clearly  have  no  infor- 
mation with  respect  to  size  in  the  final  distribution  as  you  have 
no  signal  coming  from  sizes  of  that  class.  It  comes  right  out  of 
the  result  of  the  inversion. 

Herman:  I think  that  when  we  get  below  certainly  a few  tenths  of 

a micron  and  above  about  2 microns,  as  I said  earlier,  I think 
there  are  certainly  questions  here.  But  I do  think  we  have  infor- 
mation between  a few  tenths  of  a micrometer  and  one  or  two  micro- 
meters. It  is  my  feeling  that  there  is  some  truth  to  these  inver- 
sions over  that  size  range.  There  is  no  way  for  anybody  to  say  is 
it  right  or  wrong.  The  only  way  to  ever  prove  this  is  to  make  simul- 
taneous measurements  by  several  techniques  or  go  up  and  sample  it 
directly  the  same  time  you  make  an  inversion.  We  all  know  the 
limitations  to  this. 

Twomey:  1 would  just  like  to  comment  that  under  the  conditions  of 

blown  dust  in  Arizona  when  your  visibility,  say,  in  the. Catalina 
Mountains  and  such,  is  grossly  poorer  than  what  is  normally  some- 
times total,  the  particle  concentration  as  measured  by  a particle 
counter  invariably  in  my  experience  is  low.  Now  that’s  quite  in 
agreement  with  what  you  show  up  there  with  reduced  number  of  small 
particles.  And,  in  aerosol  physics  this  is  a very  ancient  result. 
There  was  a gentleman,  back  in  1916  who  beat  a carpet  in  a room  for 
half  an  hour  and  was  very  surprised  to  find  that  his  total  particle 
concentration  was  less  at  the  end  of  this  episode.  His  lungs  were 
worse  I But  the  total  particle  count  went  down. 

Barks trom:  I have  a question  that  is  related  to  the  thing  that 

Jerry  Twitty  asked.  First  of  all,  I’d  like  to  know  if  this  is 
what  Dr.  Rodgers  would  classify  as  a messy  nonlinear  problem, 
you  know,  the  worst  kind?  And,  if  so,  how  we  can  go  about  cal- 
culating an  error  bar  on  each  of  those  inverted  data? 

Herman:  I didn’t  show  these.  We  have  done  error  bars  on  these - 

We  have  error  bars  on  the  optical  depth  determination  and  then  we 
have  error  bars,  at  least  a crude  estimation,  on  the  size  distri- 
bution, which  I didn't  show  here. 

Barkstrom:  Yes,  but  in  light  of  the  nonlinear  problem  associated 

with  the  calibration,  is  there  a problem  connected  with  the  error 
bars  that  you  get  out  of  the  linear  estimate? 

Herman:  Who  are  you  asking? 
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Barkstrom:  I*d  like  to  sort  of  ask  Dr,  Rodgers  as  well, 

Rodgers:  I was  going  to  ask  you  about  this  one  as  wellj  It 

wasn't  very  clear  to  me  whether  you  had  a linear  problem  or  a non- 
linear problem. 

Herman:  This  is  a linear  problem  but  we  are  iterating  because  we 

are  trying  to  bootstrap  up  our  weighting  function  guess, 

Rodgers:  Well,  if  you  have  to  change  the  weighting  function,  then 

it  is  not  a linear  problem, 

Herman:  We  are  making  a first  guess  to  perform  that  integral  of 

the  kernel  which  we  can't  do  analytically.  We  have  to  make  some 
type  of  weighting  function  guess  to  provide  a proper  integration 
over  the  kernel  function.  Otherwise,  if  we  use  just  large  inter- 
vals, we  have  a very  poor  representation  of  the  kernel. 

Rodgers:  The  equations  are  linear? 

Herman:  The  equation • itself , if  we  could  do  the  integral  properly, 

would  be  linear.  We  could  do  it  in  one  iteration. 

Rodgers:  Yes,  so  the  problem  is  one  of  integrating  over  a peculiar 
kernel? 

Herman:  That  is  the  basic  problem  in  particularly  the  lidar  case 

but  also  in  the  radiometer. 

Rodgers:  I still  don’t  really  understand  how  it  comes  out  non- 

linear then? 

Herman;  It  is  not  a nonlinear  problem.  We  are  iterating  simply 
to  get  a better  estimate  of  the  integral  over  the  kernel  function. 

We  could  get  the  answer  in  the  first  iteration  if  we  wanted,  but 
we  are  getting  a better  estimate  as  we  iterate  on.  I don’t  think 
it  is  equivalent  to  making  it  nonlinear  . I don't  think  you  can 
even  consider  it  equivalent  to  nonlinear. 

Rodgers:  Well,  if  it’s  linear,  you  don’t  need  to  iterate.  I just 

don’t  understand  what  the  problem  is. 

Herman:  Well,  I thought  I had  explained  the  technique  we  are  using. 

As  I say,  we  do  not  have  to  iterate.  It  is  not  necessary. 
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THE  INVERSION  OF  STRATOSPHERIC  AEROSOL  AND 
OZONE  VERTICAL  PROFILES  FROM  SPACECRAFT 
SOLAR  EXTINCTION  MEASUREMENTS 


William  P.  Chu 
Old  Dominion  University 


This  paper  analyzes  the  inversions  of  multi-^channel 
solar  extinction  measurements  in  the  0.35-1 »0  ym  wavelength 
region  to  retrieve  stratospheric  aerosol  and  ozone  vertical 
profiles  using  both  the  constrained  linear  inversion  scheme 
and  the  iterative  scheme  recently  developed  by  Twomey • 

The  inversions  of  the  multi-wavelength  solar  extinction  data 
obtained  from  spacecraft  have  been  analyzed  based  on  the 
inversion  of  computer  simulated  data  using  various  atmospheric 
models  with  differing  amounts  of  aerosol  and  ozone  in  the 
stratosphere.  The  sensitivities  of  the  inversion  schemes  to 
different  experimental  errors  are  discussed  in  terms  of 
accuracy  and  resolution  of  the  retrieved  profiles. 

1.  INTRODUCTION 

The  minor  constituents  in  the  stratosphere  have  received 
great  attention  in  recent  years  because  of  concerns  over  their 
roles  in  affecting  the  global  environment.  With  mankind's 
increasing  activities  in  the  stratosphere r such  as  supersonic 
transport  (SST)  and  Shuttle  missions,  and  ground-based  activities, 
such  as  those  involved  with  nitrogen  fertilizers,  an  increase  in 
stratospheric  aerosol  loading  together  with  the  removal  of  strato- 
spheric ozone  are  possible  through  various  chemical  reactions.  A 
realistic  assessment  on  this  problem  is  still  not  possible  due  to 
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the  lack  of  information  on  the  distributions  of  most  of  the  con- 
stituents in  our  atmosphere. 

This  paper  analyzes  one  of  the  atmospheric  remote  sensing 
methods  using  spacecraft  solar  extinction  technique  to  retrieve 
stratospheric  aerosol  and  ozone  vertical  profiles.  In  this  remote 
sensing  technique,  the  spacecraft  instrument  will  direct  toward 
the  horizon  and  measure  transmitted  solar  radiant  intensity  in  a 
number  of  spectral  intervals  with  wavelengths  from  0.38  to  1.00  ym. 
The  locations  of  the  spectral  channels  are  selected  based  on  con- 
siderations of  maximum  extinction  contribution  from  each  individual 
constituent,  interference  from  other  constituents,  and  available 
energy  for  measurements.  The  radiant  intensity  data  are  then 
inverted  to  produce  vertical  profiles  for  each  of  the  constituents. 
The  inversion  procedure  can  be  separated  into  two  steps.  The 
first  step  involved  retrieval  of  the  total  vertical  extinction  pro- 
files at  each  spectral  interval.  The  second  step  then  separated 
the  contributions  from  each  constituent  using  the  multi-spectral 
extinction  data  at  each  altitude  level.  Examples  of  inverted 
results  from  simulated  data,  including  experimental  errors,  are 
presented  and  discussed  in  this  paper. 

II.  EXPERIMENTAL  CONCEPT 

In  the  solar  extinction  experiment,  multi -wave length  measure- 
ments of  the  transmitted  solar  radiant  intensity  along  the  line  of 
sight  through  the  atmosphere  are  made  on  board  the  spacecraft. 

The  basic  geometry  is  illustrated  in  Fig.  1.  As  the  spacecraft 
emerges  from  Earth's  shadow,  the  radiometer  on  board  the  spacecraft 
will  point  to  the  Sun , measuring  solar  radiant  intensity  at 
several  pre-selected  wavelength  regions.  In  this  paper,  we  will 
consider  the  radiometer  with  a small  field  of  view  in  order  to 
achieve  higher  vertical  resolution.  The  radiometer  is  assximed  to 
either  lock  on  a fixed  position  on  the  solar  disk  or  scan  across 
the  solar  disk  for  data  acquisition. 
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SOLAR  OCCOLTATION  G3EOMETRY 

I ^ = T ( hj  (S)  1 1$ 

LAMBERT -BEER  LAW  oo 

T(hT)  = EXP  {-  2 / P (h)  dP(h)} 

y3(hi  - EXTINCTION  PROFILE 
P(ht  - REFRACTED  OPTICAL  PATH 

Fig.  I.  Geometric  of  the  solar  extinction  experiment 


The  monochromatic  radiant  intensity  reaching  the  radiometer 
is  given  by  the  fundamental  equation  of  radiative  transfer  (Ref.  1) 


=^X<“z> 


=^X'V"X<>‘Z-V  * I 


1 


J, (h)dx. (h  ,h  ) 
A A Z o 


(1) 


T, (h  ,h  ) 
A Z O 


Equation  (1)  relates  at  wavelength  X,  the  radiant  intensity 

I, (h  ) measured  at  distance  h , to  the  radiant  intensity  I, (h  ) at 
A Z Z A O 

boundary  h , the  monochromatic  transmittance  t,  (h  ,h  \ and  source 
o A z o 

function  3-t  distance  h along  path  from  h^  to  h^. 

For  solar  extinction  experiment  with  spectral  channels  closed 
to  the  visible  region,  the  contribution  by  atmospheric  emission  to 
the  radiant  intensity  is  very  small  and  can  safely  be  neglected. 
The  radiative  transfer  equation  is  thus  greatly  simplified  to 

'X<"z>  = <"> 
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where  the  radiant  intensity  I, (h  ) is  now  the  solar  radiant  inten- 

A O 

sity  and  t. (h  ,h  ) is  the  transmittance. of  the  atmosphere  between 
A z o 

the  Sun  and  the  spacecraft. 

In  the  wavelength  range  from  0.3  to  1.0  micron,  atmospheric 
attenuation  are  predominantly  caused  by  aerosol,  ozone,  and  air 
molecules  (Rayleigh  component)  with  minor  contributions  from 
water  vapor  and  molecules.  Models  of  the  stratospheric  extinc- 
tion as  a function  of  wavelength  profile  can  be  constructed  from 
different  vertical  distributions  of  aerosol  and  ozone.  Figure  2 
shows  an  extinction  coefficient  model  as  a function  of  wavelength 
at  an  altitude  of  18  km.  Notice  that  at  most  of  the  wavelength 
shown,  the  total  extinction  consists  of  contributions  from  aerosol, 
ozone,  and  Rayleigh  components  with  different  weights.  The 
weightings  will  also  be  different  for  different  altitude  levels, 
depending  on  the  distributions  of  the  different  constituents. 


Fig.  2.  :^xtinction  as  a function  of  wavelength  model  with 
an  altitude  of  18  km. 
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If  we  made  the  assumptions  that  the  atmosphere. is  spherically 

stratified  and  spherically  symmetric , the  transmittance  function 

T-  (h  ,h  ) will  then  be  given  by  the  Lambert-Beer  law  as 
A z o 


T.(h  ,h  ) = exp{-  2 /“  3.(h)dP(h,h J } 

A Z O J^iip  A J. 


(3) 


where  3, (h)  is  the  atmospheric  extinction  coefficient  at  wavelength 
A 

A as  a function  of  vertical  height  h,  and  P(h,h^)  is  the  optical 
path  at  height  h for  line  of  sight  with  tangent  height  h^. 
Equivalently,  the  optical  depth  9^0^^)  can  be  defined  as 

9x(\)  = 


- 2 r 6,  (h)dP(h,h  ) (4) 

T 

The  determination  of  vertical  extinction  profile  at  each  wavelength 
A required  the  inversion  of  the  integral  equation  in  Eq.  (4) . 


Ill . INVERSION  TECHNIQUES 

A.  Inversion  of  Measured  Transmittcince  to  Total  Extinction  Pro- 
files 

In  Eq.  (4) , the  measured  parameter  g (h_)  is  equal  to  a kind 

A T 

of  convolution  of  the  weighting  function  9P(h,h^)/8h  with  the 

xinknown  vertical  extinction  profile  3.  (h)  . The  integral  can  be 

A 

approximated  with  a sum  over  n discrete  atmospheric  layers  with 
equal  thickness  and  assigning  to  each  layer  an  averaged  extinction 
coefficient  3^.  The  integral  equation  can  then  be  replaced  by  a 
system  of  linear  equations 
n 

= I e.  i = (5) 

j = i 3 J 

where  P^^  is  the  optical  path  length  in  the  jth  layer  with  tcingent 
height  at  the  ith  layer.  Equation  (5)  can  be  abbreviated  into 


matrix  form 


where  g is  the  colviinn  vector  for  the  measured  optical  depth,  3 is 
the  column  vector  for  the  unknown  extinction  profile,  and  P is  the 
optical  path  length  matrix. 

Equation  (6)  can  be  solved  directly  for  3 either  exactly  when 

y 

n = m or  in  the  least  square  sense  when  n < m 


e = P 


n = m 


(7a) 


ip  -1  m 

3=(PP)  Pg  n<m 


(7b) 


-1  T 

where  P is  the  inverse  of  P and  P is  the  transpose  of  P.  The 
solutions  obtained  from  the  direct  inversion  are  generally  unstable 
due  to  the  presence  of  noise  associated  with  the  measured  parameter 
g.  In  order  to  suppress  the  unphysical  oscillations  in  the 
inverted  solutions,  some  constraints  on  the  high  frequency  com- 
ponents of  the  solutions  have  to  be  incorporated  in  the  inversion 
schemes.  We  have  analyzed  two  different  inversion  methods.  The 
first  method  is  the  linear  constrained  inversion  as  developed  by 
Twomey  (Ref.  2)  and  Phillips  (Ref.  3);  the  second  method  is  the 
iterative  scheme  recently  developed  by  Twomey  (Ref.  4) . We  will 
discuss  the  two  methods  separately. 

I.  Linear  Constraint  Inversion 


The  solution  to  Eq.  (6)  is  given  by  the  following  expression 


T 

3 = (P  P + yH) 


P g 


(8) 


where  normally  H is  the  constrained  matrix  and  y is  a constant 
smoothing  parameter.  In  our  analysis,  we  used  the  constrained 
matrix  H which  minimizes  the  second  difference  of  the  solution. 
The  smoothing  parameter  y whose  magnitude  is  proportional  to  the 
noise  level  of  the  measurements  has  been  replaced  by  a diagonal 
matrix  of  element  The  reason  for  using  a variable  smoothing 

parameter  y arises  from  the  nonlinear  expression  for  the  noise 
term  in  Eq.  (4) . Assuming  the  radiometer  has  a noise  level  e^ 
associated  with  each  radi^t  measurement  Ij^,  the  deduced 
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I. 


transmittance  will  have  approximately  the  same  noise  level  e^^. 
By  computing  the  optical  depth  we  have 

g^  = to{l/(x^  + e^) } (9) 

for  <<  we  can  expand  Eq.  (9)  and  obtain  . 


~ £n{l/T^}  - 


(10) 


The  second  term  on  the  right  side  is  the  equivalent  noise 
level  for  the  parameter  g^.  Thus,  we  choose  the  matrix  element 


^ii 


as 


r e. 


(ID 


where  y is  a constant  parameter  to  be  adjusted  for  optimum 
o 

inverted  solutions . 


2.  Iterative  Inversion 


The  iterative  method  starts  by  initially  assuming  a guessed 
solution,  and  the  solution  is  continuously  updated  through  the 
following  iteration: 


(k  + 1)  ^ g(k)  (j.(k)  _ i)K.  .3^^^ 

D ] X 13  D 


(12) 


(k)  (k) 

where  B . is  the  solution  3 . at  k-iteration , r . is  the  ratio 
D 3 1 

of  measured  g^  to  the  computed  g^  from  the  k-iterated  solution, 
and  K.  . is  the  normalized  weighting  fxinction  9P(h,h„)/9h  where 

ID  ip 


< 1.  The  iterative  process  is  terminated  when  the  computed 
radiant  approaches  the  measured  radiant  to  within  the  instrument 
noise  level. 


B.  Inversion  of  Multi-wavelength  Extinction  Profiles  Into 
Constituent  Profiles 

The  inversion  methods  discussed  in  (A)  will  generate  multi- 
spectral  extinction  vertical  profiles  from  the  multi-channel 
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extinction  measurements.  The  second  step  in  the  inversion  process 
will  be  to  separate  the  contributions  from  individual  constituent. 
For  the  wavelength  region  of  interest,  the  total  extinction 
coefficient  at  a fixed  altitude  level  can  be  written  as  the  sum  of 
contributions  from  aerosol,  ozone,  and  air  molecules  as 


3^  ^ = 3 , (A)  + a,  . 3 , (A  ) + b,  . 3 (A  ) 

total  aerosol  Ai  mol  o Ai  o o 

3 . 


(13) 


where  3 ^ (A  ) is  the  Rayleigh  extinction  coefficient  at  a refer- 

mol  o 

ence  wavelength  A^,  a^^  is  the  wavelength  dependence  of  the 
Rayleigh  extinction,  is  the  wavelength  dependence  of  the  ozone 

absorption  profile,  and  aerosol  extinction  as 

a function  of  wavelength  A.  In  this  study,  we  have  assumed  a 
two-parameter  model  for  the  description  of  the  aerosol  wavelength 
dependent  extinction  coefficient  as 


3 


aerosol 


(X)  = A x“ 


(14) 


where  A and  a are  the  two  parameters.  Substituting  Eq.  (14)  into 

Eq.  (13) , we  then  have  a system  of  nonlinear  equations  with  unknown 

3 n(^)/3  (A),  A,  and  a for  each  altitude  level.  ^^The  system 

mol  o o o 

of  nonlinear  equations  can  be  solved  using  the  Marquardt  (Ref.  5) 
algorithm.  The  Marquardt  algorithm  is  a minimijm  search  procedure 
for  a system  of  nonlinear  equations.  An  initial  set  of  guessed 
solutions  for  the  unknowns  is  updated  through  the  following 
iterative  equation 


x"'*  ‘>.X« 


f\j  ~ ~ 


1-rT/  k.  , k. 
J (x  )P(x  ) 


(15) 


where  JC  is  the  solution  vector  for  the  unknowns  at  the  k- 

k * 

iteration,  F(x  ) is  the  Residue  vector  for  Eq.  (13)  at  the  k- 

iteration,  J(x  ) is  the  Jacobian  matrix  with  element  J. , = 3f ./3x. 

^ ID  1 D 

where  f^  is  the  ith  nonlinear  equation.  I is  the  identity  matrix, 
cind  is  a control  parameter  at  the  k-iteration.  The  control 
parameter  A^^  is  adjusted  at  each  iteration  for  fast  convergence  to 
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the  miniimam.  The  iterative  process  is  terminated  when  the  dif- 
ference between  two  consecutive  residue  squares  is  less  than  0.1%. 


IV.  INVERSION  RESULTS 

Inversion  results  have  been  obtained  from  applying  the 
inversion  techniques  discussed  earlier  to  simulated  spacecraft 
solar  extinction  measurements-  Several  models  of  the  aerosol  and 
ozone  profiles  were  used  to  generate  atmospheric  transmittance  pro- 
files in  different  wavelength  regions.  Wavelengths  at  0-38,  0.45, 
0.6,  and  1.0  micron  were  chosen  to  coincide  with  the  four  spectral 
channel  locations  for  the  Stratospheric  Aerosol  and  Gas  Experiment 
(SAGE) . The  simulated  radiant  intensity  data  are  generated  from 
the  atmospheric  transmittance  profiles  by  incorporating  the  space- 
craft geometry.  The  radiometer  is  assumed  to  have  a field  of  view 
of  30  arc  seconds,  when  situated  on  a satellite  at  an  orbital  alti- 
tude of  600  km.  A total  of  80  radiant  intensity  data  points  are 
generated  per  channel  covering  tangent  altitudes  from  10  to  50  km 
in  equal  increments  of  0.5-km  height.  Each  data  point  is  simulated 
by  assuming  that  the  radiometer  is  pointing  to  a different  location 
on  the  solar  disk. 

The  inversion  of  these  radiant  profiles  was  then  performed  by 
introducing  experimental  errors  of  various  origin  and  magnitudes 
to  the  simulated  data.  There  are  three  types  of  erros  which 
directly  affect  the  inversion  results.  The  first  type  of  error 
is  the  bias  error  in  the  determination  of  the  exact  tangent 
heights  for  all  the  data  points.  The  second  type  of  error  is  the 
noise  associated  with  each  measurement.  The  third  type  of  error 
is  associated  with  the  uncertainty  in  knowing  the  pointing  angle 
of  the  radiometer  for  each  data  point.  This  pointing  error  will 
result  in  both  an  erroneous  value  for  the  unattenuated  solar 
radiance  due  to  solar  limb  darkening  effect  and  an  erroneous  value 
for  the  instantaneous  tangent  height - 
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The  three  types  of  errors  mentioned  earlier  in  different  mag^ 
nitudes  were  added  to  the  simulated  radiant  data  and  inversions 
were  performed  to  determine  the  sensitivity  of  these  errors  on  the 
inversion  results » The  atmosphere  is  divided  into  45  homogeneous 
layers  of  0.5-km  thic5cness  between  10  and  21  km,  and  increasing 
thicknesses  from  21  to  50  km.  The  path  length  matrix  is  computed 
by  using  a ray  trace  calculation  to  include  atmospheric  refraction 
effect. 

Figures  3 to  6 show  inversion  results  for  simulated  extinc- 
tion measurements  using  the  linear  constrained  inversion  method. 
The  input  aerosol  vertical  profile  used  is  deduced  from  an  actual 
lidar  backscattered  profile  with  vertical  resolution  of  0.15  km. 
The  ozone  vertical  profile  used  is  similar  to  Elterman's  model 
(Ref.  6)  except  that  artificial  layering  structures  were  added. 

The  Rayleigh  extinction  profile  is  also  deduced  from  an  actual 
rawinsonde  temperature  and  pressiire  profiles.  In  these  simulated 
measurements,  experimental  errors,  including  noise  levels  of  0.1% 
to  full  scale  radiant  intensity  and  pointing  iincertainty  of  3 arc 
seconds,  were  added  to  the  simulated  measurements.  The  inversion 
results  are  presented  in  terms  of  aerosol  vertical  extinction  at 
1.0  ym  (Fig.  3),  ozone  vertical,  extinction  at  0.6  ym  (Fig.  4), 
Rayleigh  vertical  extinction  at  0.38  ym  (Fig.  5),  and  aerosol 
optical  model  parameters  (Fig.  6) . The  aerosol  optical  model 
parameter  a is  assumed  to  be  -0.8  between  17  and  20  km  and  equal 
to  -1.6  for  the  rest  of  the  altitude  range.  Inspections  of  Figs. 

3 to  6 indicate  that  both  aerosol  and  ozone  vertical  profiles  can 
be  inverted  with  good  accuracy  up  to  an  altitude  of  40  km.  The 
poor  inversion  results  for  the  Rayleigh  profile  below  14  km  are 
caused  by  the  rapid  decreasing  signal  level  at  the  0.38  ym  chan- 
nel. In  Fig.  6,  the  inverted  results  for  the  aerosol  optical 
parameter  a within  the  Junge  layer  region  show  some  fluctuation 
and  no  inversion  results  can  be  obtained  above  20  km  due  to  the 
rapid  decrease  of  aerosol  concentration  in  this  region. 
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Molecular  extinction,  km 


Fig.  5.  Inversion  results  for  Rayleigh  vertical  extinction 
profile  at  0,38  lim. 


-Exponent 


Fig.  6.  Inversion  results  for  aerosol  optical  model 
parameter  a as. a function  of  altitudes. 


Figiires  7 to  9 show  inversion  results  for  the  same  input 
models  except  that  the  noise  level  for  each  measurement  has  been 
increased  to  l.d%  and  the  pointing  uncertainty  has  been  increased 
to  15  arc  seconds.  Both  the  inverted  results  for  aerosol  and 
ozone  profiles  show  consideraible  decrease  in  resolution.  This 
behavior  is  consistent  with  the  inversion  technique  in  which  the 
smoothing  parameter  in  Eq..  (11)  is  increased  to  accommodate 
the  high  noise  level  in  the  measurements.  The  inversion  results 
for  the  aerosol  optical  model  parameter  which  are  not  shown  here 
are  similar  to  Fig.  6 except  larger  fluctuations  were  observed. 

Figures  10  to  11  show  inversion  results  for  a set  of  dif- 
ferent aerosol  and  ozone  vertical  profiles.  The  aerosol  profile 
is  deduced  from  the  first  published  lidar  observation  of  the 
Volcano  de  Fuego  eruption  (Ref-  7)  showing  sharp  layering  struc- 
tures. The  peak  layer  at  19  km  is  approximately  0.5  km  wide.  The 
inversion  cannot  reproduce  this  sharp  layering  structure  due  to 
the  integrating  effect  over  the  radiometer's  field  of  view. 


Fig.  7.  Inversion  results  for  aerosol  vertical  extinction 
profile  with  increased  experimental  ’errors. 
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Fig.  10.  Inversion  results  for  aerosol  vertical  extinction 
profile  with  different  aerosol  vertical  extinction  profile. 


Fig.  11.  Inversion  results  for  ozone  vertical  extinction 
profile  with  different  ozone  vertical  extinction  profile. 


Fj9'.  12.  Inversion  results  for  aerosol  vertical  extinction 
profile  using  the  iterative  inversion  scheme. 


Fig.  13.  Inversion  results  for  ozone  vertical  extinction 
profile  using  the  iterative  inversion  scheme. 
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Figures  12  and  13  show  the  inversion  results  using  the  itera- 
tive method  as  discussed  in  Eg.  (12) . The  atmospheric  model  and 
experimental  noise  levels  are  identical  to  those  as  shown  in 
Figs.  3 to  6.  Comparison  of  the  inversion  results  in  this  case 
to  those  as  shown  in  Figs.  3 and  4 indicates  that  they  are  quite 
similar.  The  inverted  aerosol  profile  at  altitude  above  30  km  in 
this  case  shows  some  large  amplitude  oscillations.  This  is  expec- 
ted as  the  noise  level  in  this  altitude  region  is  considerably 
higher  than  the  signal  level. 

Inversions  have  also  been  performed  on  simulated  data 
including  bias  errors  in  the  tangent  height  determination.  It  is 
found  that  bias  errors  will  produce  a shift  in  altitude  scale  of 
the  inverted  profile  in  direct  proportion  to  the  bias  magnitude. 


V.  CONCLUDING  REMARKS 

This  paper  has  demonstrated  that  measurements  from  space- 
craft solar  extinction  experiment  can  be  inverted  to  produce 
aerosol  and  ozone  vertical  profiles  from  cloud  top  up  to  approxi- 
mately 50  km  altitude.  Analysis  of  the  inversion  results  from 
simulated  measurements  including  various  experimental  errors  indi- 
cated that  the  resolution  of  the  inverted  profiles  will  be 
degraded  as  the  errors  are  increased. 

Both  the  linear  constrained  inversion  method  and  the  itera- 
tive method  have  been  used  for  the  inversion  of  solar  extinction 
measurements.  The  accuracy  of  the  inverted  results  from  the  two 
different  inversion  methods  are  similar. 
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SYMBOLS 

coefficient  for  wavelength  dependence  of  Rayleigh  extinction 
model  parameter  for  aerosol  extinction  as  function  of  wave- 
length dependence 

coefficient  for  wavelength  dependence  of  ozone  absorption 

noise  level 

residue  vector 

optical  depth 

distance 

constrained  matrix 
identity  matrix 

monochromatic  radiant  intensity 
Jacobian  matrix 

monochromatic  source  radiant  intensity 
iteration  number 
normalized  weighting  function 
optical  path  length 

ratio  of  measured  optical  depth  to  computed  optical  depth 
solution  vector  for  nonlinear  equations 

model  parameter  for  aerosol  extinction  as  function  of  wave- 
length dependence 
extinction  coefficient 
smoothing  parameter 
wavelength 

control  parameter  for  iterative  solution  to  nonlinear 
equations 

monochromatic  transmittance 


522 


REFERENCES 


1.  S.  Chandrasekhar,  "Radiative  Transfer,”  p.  9.  Oxford 

University  Press,  New  York,  1950. 

2.  S.  Twomey,  On  the  nvmerical  solution  of  Fredholm  integral 

equations  of  the  first  kind  by  the  inversion  of  the  linear 
system  produced  by  quadrature,  J.  Assoc.  Comput.  Mach.  10, 
99  (1950) . 

3.  D.  L.  Phillips,  A technique  for  the  nimierical  solutions  of 

certain  integral  equations  of  the  first  kind,  J.  Assoc. 
Comput.  Mach.  9,  97  (1962). 

4.  S.  Twomey,  Comparison  of  constrained  linear  inversion  and  an 

iterative  nonlinear  algorithm  applied  to  the  indirect  esti- 
mation  of  particle  size  distributions,  J.  Comput.  Phys.  18, 
188  (1975) . 

5.  D.  W.  Marquardt,  An  algorithm  for  least-square  estimation 

of  nonlinear  parameters,  J.  Soc.  Ind.  Appl.  Math.  11,  431 
1963. 

6.  L.  Elterman,  Visible  and  IR  attenuation  for  altitudes  to 

50  km.  Air  Force  Cambridge  Research  Laboratory,  1968 
[ AFCRL-68-0153] . 

7.  M.  P.  McCormick  and  W.  H.  Fuller,  Lidar  measurements  of  two 

intense  stratospheric  dust  layers,  Appl.  Opt.  14,  4 (1975). 


523 


DISCUSSION 


Shettle:  On  one  of  your  first  figures  where  you  show  the  optical 

thicknesses,  you  included  nitrogen  dioxide,  which  would  add  a 
fifth  unknown.  I can't  recall  whether  that  was  a . . . 

Chu:  No,  that  was  not  in  the  simulation. 

Shettle:  The  question  is,  would  it  add  a significant  effect, 

particularly  when  you  start  trying  to  get  the  slope  of  the  aerosol 
extinction  versus  wavelength? 

Chu:  If  we  assume  we  don't  know  anything  about  nitrogen  dioxide, 

it  will  contribute  some  error  to  the  slope  on  the  aerosol  curve - 
It  should  be  pointed  out  that  the  peak  nitrogen  dioxide  concen- 
tration occurs  approximately  at  30  km;  whereas  the  aerosol  peak  is 
around  18  to  20  km.  There  is  a spatial  discrimination  as  well  as 
a spectral  discrimination  utilizing  this  technique. 

Shettle:  Yes,  except  you  are  not  looking  for  it  and  if  you  include 

it,  that  would  give  you  a fifth  unknown.  And  you  still  only  have 
four  wavelengths. 

Chu:  In  fact,  on  SAGE  we  do  have  five  channels. 

McCormick:  Yes,  very  recently  we  have  encountered  one  more  chan- 

nel with  lower  vertical  resolution.  Dr.  Chu's  paper  addresses  a 
four-channel  instrument  only. 

Pepin:  I would  like  to  comment  on  that  last  question.  The  fact 
that  the  NO2  contributes  only  a small  amount  means  that  one  can 
correct  for  it  quite  accurately  by  not  knowing  it  very  well  and 
subtract  it  out.  It's  a percent  of  a small  percent. 

Shettle:  The  slope  of  the  aerosol  extinction  is  still  sensitive 

to  the  N02^  because  it  is  proportional  to  the  change  in  the  extinc- 
tion with  wavelength,  so  extra  extinction  will  add  to  the  error. 

Pepin:  What  I am  saying  is  that  if  it  only  contributes  one  per-- 

cent  to  the  signal  and  if  you  only  know  it  to  50  percent,  then  it 
will  only  contribute, one-half  percent  to  the  inversion  problem  in 
te2nns  of  signal. 

Green:  Ben  Herman,  in  his  talk,  indicated  that  if  the  extinction 

due  to  aerosols  were  curved  upward, you  have  one  characteristic 
shape  of  size  distribution  and  if  it  were  to  curve  downward, 
another.  What  does  the  effect  of  assigning  a straight  line  have 
upon  your  inversion  procedure?  Is  it  then  strictly  a single  Junge 
power  law  or  b»oad  range  or  what? 
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Chu:  You  mean  trying  to  approximate  the  aerosol  by  a straight 

line? 

Green:  The  power  law,  the  Law? 

Chu:  Well,  all  we  can  get  is  two  unknown  parameters.  We  are 

limited  by  the  channel  number.  We  have  four  measurements  and  we 
are  looking  for  four  unknowns.  This  is  about  the  maximum  we  can 
do.  I don't  see  any  way  around  the  problem  besides  just  trying  to 
describe  the  aerosol  as  a two-parameter  model.  Of  course,  it  might 
not  exactly  reflect  what  nature  is  doing.  But  according  to  the 
paper  that  Tom  Swissler  presented  about  two  days  ago,  the  two- 
parameter  model  is  quite  adequate  in  describing  most  of  the  size 
distribution.  I mean  not  the  bimodal  type;  but  the  single  modal 
size  distribution.  Of  course,  there  is  some  sort  of  spread  in 
terms  of  measuremental  accuracies,  almost  quite  comparable  to  what 
we  are  seeing  here. 

Green:  Ben  had  quite  a dichotomy.  Am  I correct  in  reading  your 

paper,  that  when  you  suddenly  had  an  upward  curvature  you  lost  a 
peak?  And  when  you  had  a downward  curvature,  I guess  you  had  two 
humps? 

Herman:  When  it  was  curved  this  way,  we  had  the  slope  with  the 

hump  on  it — the  Junge  with  a hump  on  it.  When  it  was  curved  the 
other  way,  in  other  words,  high  in  the  middle  and  low  on  either 
end,  it  gave  kind  of  a log  normal.  Now  the  straightest  one  that 
we  have,  which  is  almost  straight  but  not  perfectly  straight,  gives 
not  quite  Junge,  a little  bit  of  deviation  from  Junge.  But  don't  for- 
get that  Junge  will  give  a straight  line  from  zero  to  infinity  in 
size.  Our  size  limits  are  finite  so  you  wouldn't  expect  it  to  be  a 
perfect  relationship. 

Pepin:  I think  that  one  has  to  think  of  the  fact  that  the  SAGE 

experiment  is  designed  to  look  at  the  stratosphere.  When  you  are 
looking  at  the  stratosphere,  then  you  have  to  ask  yourself  what 
kind  of  aerosols  are  present  in  the  stratosphere  in  terms  of  size 
distribution.  You  have  to  go  back  to  the  type  of  data  that  was 
presented  in  the  paper  by  Harris  and  Rosen  concerning  the  distri- 
bution of  sizes  that  have  been  measured  in  the  stratosphere;  those 
aerosols  look  very  different  than  those  presented  in  the  paper  by 
Ben  Herman,  which  were  tropospheric  aerosols.  Consequently,  the 
regime  that  the  SAGE  experiment  has  to  work  in  is  very  different. 

Fraser:  My  question  has  to  do  with  your  assumption  of  spherical 

symmetry.  With  respect  to  the  gases,  measurements  by  Lazarus  show 
that  you  can  get  a significant  change  in  the  gas  concentration  in 
the  distance  of  a few  hundred  kilometers.  Also,  one  of  your  primary 
purposes,  as  I understand  it,  in  this  experiment,  is  to  find  out 
something  about  volcanic  dust  put  into  the  stratosphere.  I would 
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think  that  there  would  be  significant  gradients  in  the  volcanic 
dust.  Have  you  examined  how  that  affects  your  data  reduction? 

Chu:  Well,  for  the  ozone  problem  we  have  realized  the  difficulty. 
We  haven't  looked  into  the  details  yet  about  the  horizontal  homo- 
geneity problem  and  how  they  are  going  to  affect  our  inversion 
results.  What  we  would  probably  end  up  with  is  sort  of  a decrease 
in  our  resolution  if  the  ozone  is  really  fine  structured.  As  far 
as  the  aerosols  are  concerned,  particularly  volcanic  dust,  some  of 
the  lidar  data  taken  over  Lcingley  recently  has  shown  that  we  had 
considerably  homogeneous  layering  structure  over  quite  an  extended 
region. 

Fraser:  1 thought  a few  days  ago  someone  showed  a slide  with  sig- 

nificant changes  in  one  day? 

Chu:  One  day,  yes,  but  over  a single  experiment  period  which  is 

of  the  order  of  one  minute  or  so,  we  are  looking  over  a . . . 

Fraser:  Yes,  but  you  are  looking  over  a horizontal  distance  effec- 

tively of  a few  hundred  kilometers. 

Chu:  That's  right.  A few  hundred  kilometers — we  haven't  really 

seen  any  large  gradients. 

McCormick:  That  is  correct.  For  example,  we  perfomed  lidar 

measurements  from  Kansas  City,  Missouri  during  July  1975. 
Simultaneously,  the  NCAR  lidar  in  Boulder,  Colorado,  about  600  miles 
away,  obtained  data.  A comparison  of  these  data  showed  very  little 
inhomogeneity  in  the  stratospheric  dust.  I think  you  are,  perhaps, 
recalling  some  of  the  earlier  papers  showing  data  taken  soon  after 
the  del  Fuego  volcanic  eruption.  Our  experience,  however,  is  that 
there  is  very  little  change  during  a data-taking  period  of  a few 
hours.  Even  soon  after  the  eruption,  we  experienced  significant 
changes  in  stratospheric  dust  only  over  a period  of  days,  not 
minutes.  I feel  the  assumption  of  horizontal  homogeneity  for  the 
stratospheric  aerosol  is,  therefore,  a good  one  under  most  atmos- 
pheric conditions . 

Shettle:  He  showed  a graph  of  the  transmissions.  If  anything,  the 

transmissions  were  smaller  than  the  ones  that  were  being  measured 
by  ground  base  measurements  because  you  are  dealing  with  long-slant, 
paths  that  are  showing  transmissions  in  a 50  to  80  percent  range. 

Dr ay son:  I've  got  a comment  on  this  idea  of  measuring  the  ozone 
and  then  going  backwards.  It  seems  that  here  you  are  really  trying 
to  have  your  cake  and  eat  it  too.  On  the  one  hand, you  are  assiming 
that  you  know  everything  about  the  atmospheric  aerosols  sufficiently 
well  to  find  the  ozone  profile.  And  then  you  are  turning  around 
and  saying,  I can  put  the  ozone  in  and  get  another  parameter  about 
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the  atmospheric  aerosols.  If  you  don't  know  the  parameters  of  the 
atmospheric  aerosols  well  enough  in  the  first  place,  you  can't  do 
the  ozone  experiment.  We  have  done  similar  types  of  calculations 
for  the  infrared  region  of  the  spectrum  and  with  lots  of  potential 
sources  of  error.  The  geometry  is  the  same  and  much  of  the  ideas 
are  the  same  in  here,  and  we  find  that  the  accuracy  you  get  is  not 
nearly  as  good  as  the  results  in  today's  paper. 

Herman:  I think  all  this  has  been  looked  at.  There  are  four  wave- 

lengths in  this  experiment.  And,  they  can  be  solved  simultaneously 
for  four  pieces  of  information.  You  caui  get  the  transmission  at 
two  wavelengths;  you  can  get  the  ozone  amount  and  you  can  get  the 
Rayleigh  scatter  or  any  other  combination.  Anything  beyond  that, 
you  have  to  make  assumptions. 

Dray son:  No,  I don't  think  so.  If  you  have  done  that  you  haven't 

got  the  ozone  amount.  You've  got  a parameter,  but  it  isn't  the 
ozone  amount,  unless  you  can  prove  definitely  that  those  other  two 
parameters  completely  describe  the  aerosols  and  the  ozone  amount 
comes  straight  out  of  it  without  any  further  information  about  the 
aerosols. 

Herman:  There  are  four  simultaneous  equations  and  one  can  solve 

for  four  pieces  of  information. 

Dray son:  If  the  atmosphere  is  described  by  four  pieces  of  infor- 

mation, then  you've  got  it.  But,  if  it  is  really  described  by 
five  pieces  of  information,  then  you  haven't  got  your  ozone  pro- 
file but  some  combination  of  aerosol  and  ozone. 

Herman:  My  answer  to  your  statement  was  that  you  can  get  four 

pieces  of  data  out  of  this  experiment.  Any  four  pieces  you  want. 

If  you  need  any  more  them  that,  then  you  have  to  make  assumptions. 

Dray son:  The  question  is,  do  you  need  more?  Can  you  describe  the 

atmosphere  in  these  four  parameters?  If  not,  what  you  have  pre- 
sented as  ozone  data  is  not  really  ozone  data,  but  is  a combination 
of  that  plus  aerosol  parameters. 

Herman:  You  can  get  ozone  data  if  you  make  proper  assumptions — 

enough  assximptions  to  get  the  aerosol  data  out  of  it  and  vice- 
versa. 

Dray son:  But  are  they  correct  assumptions? 
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INVERSION  OF  SOLAR  EXTINCTION  DATA  FROM  THE 


APOLLO*-SOYUZ  TEST  PROJECT  STRATOSPHERIC 
AEROSOL  MEASUREMENT  (ASTP/SAM) 
EXPERIMENT 


Theodore  J.  Pepin 
University  of  Wyoming 


The  ASTP/SAM  Experiment  was  flown  to  demonstrate  that 
direct  solar  occultation  measurements  by  photometers  and  from 
photographs  can  be  used  for  defining  stratospheric  aerosols. 
This  paper  contains  a description  of  the  inversion  methods 
that  have  been  used  to  determine  the  vertical  profile  of  the 
extinction  coefficient  due  to  the  stratospheric  aerosols 
from  the  data  measured  during  the  ASTP/SAM  solar  occultation 
experiment.  Inversion  methods  include  the  "onion  skin  peel 
technique"  and  methods  of  solving  the  Fredholm  equation  for 
the  problem  subject  to  smoothing  constraints.  The  latter  of 
these  approaches  involves  a double  inversion  scheme  that  has 
been  employed. 

Comparisons  are  made  between  the  inverted  results  from 
the  SAM  experiment  and  near  simultaneous  measurements  made 
by  lidar  and  balloon  born  dustsonde.  The  results  are  used  to 
demonstrate  the  assumptions  required  to  perform  the  inver- 
sions for  aerosols. 


I . INTRODUCTION 

This  experiment  was  designed  , to  demonstrate  the  feasibility 
of  remotely  sensing  aerosols  in  the  stratosphere  from  a low- 
orbiting  manned  spacecraft.  Increasing  interest  in  the  strato- 
sphere has  led  to  the  investigation  of  methods  for  remote  sensing 
from  Earth-orbiting  satellites  (Ref.  1) . Information  gained  from 
the  Stratospheric  Aerosol  Measurement  Experiment  performed  during 
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the  Apollo-Soyuz  Test  Project  mission  will  be  used  in  the  design 
of  remote-sensing  ecjuipment  for  future  satellite  missions. 

The  instrument  used  for  making  these  stratospheric  aerosol 
measurements  consisted  of  a photometer  and  associated  electronics 
that  provided  a signal  to  the  command  module  telemetry  system.  A 
Hasselblad  data  camera  (HDC)  equipped  with  special  infrared  (IR) 
film  and  filter  was  used  to  photograph  the  sunset  and  sunrise 
events.  The  experiment  technique  involved  directly  measuring  solar 
intensity  (photometer)  and  sun  shape  (photographs)  in  the  spectral 
region  centered  at  approximately  0.84  ym.  Immediately  before  the 
satellite  night,  as  the  spacecraft  approached  the  shadow  of  the 
Earth,  the  line  of  sight  to  the  Sun  passed  first  through  the  upper 
layers  of  the  atmosphere  and  then  steadily  down  into  the  lower 
layers  of  the  troposphere.  During  the  1.5  minutes  (approximately) 
required  for  the  instrimient  line  of  sight  to  pass  through  the  lower 
150  km  of  the  atmosphere,  the  solar  intensity  was  recorded  by  the 
photometer,  and  solar  disk  shape  changes  were  photographed.  The 
same  set  of  measurements  was  made  at  satellite  dawn  as  the  space- 
craft emerged  from  the  shadow  of  the  Earth. 

The  total  extinction  coefficient  was  obtained  from  the  varia- 
tion of  the  solar  intensity  as  a function  of  total  airmass  dis- 
tributed along  the  line  of  sight.  At  the  effective  wavelength  of 
the  photometer  and  the  photographic  system,  the  extinction  was 
principally  produced  by  the  atmospheric  aerosols;  the  measurements 
are  being  used  to  determine  the  aerosol  concentration. 

To  verify  the  performance  of  the  SAM  Experiment,  ground- truth 
data  were  acquired  by  dustsonde  (a  balloon-borne  aerosol  optical 
counter)  and  a lidar  system  (ground-based  laser  radar) . The  dust- 
sonde was  flown  from  the  Richard s-Gebaur  Air  Force  base  (latitude 
(lat.)  38-8^N,  longitude  (long.)  94.7^W)  near  Kansas  City,  Missouri, 
at  the  same  time  and  place  of  the  ASTP  second  sunset  SAM.  The 
lidar  measurements  were  also  made  from  Richards-Gebaur  Air  Force 
Base  on  Earth  nights  bracketing  and  during  the  second  sunset  SAM. 
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II.  DESCRIPTION  OF  INSTRUMENTATION 


A.  The  SAM  Instrument 

The  photometer  (Fig.  1)  used  for  the  SAM  Experiment  utilizes 
a pin  diode  detector  having  a 10^  field  of  view,  and  looks  at  the 
Sun  through  the  command  module  window  at  a wavelength  centered  at 
approximately  0.84  ym.  The  detector  was  used  in  the  photovoltaic 
mode  to  detect  the  solar  signal,  and  samples  were  taken  at  a rate 
of  10  samples/second  using  a 12-bit  analog-to-digital  converter. 
The  signal  output  of  the  converter  was  recorded  and  transmitted  by 
the  command  module  data  system. 


Fig.  I.  The  photometer  used  for  making  stratospheric  aerosol 
measuremen ts . 
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B.  Camera  System 


The  photographic  portion  of  the  experiment  consisted  of  a 
series  of  timed  IR  photographs  of  spacecraft  sunrises  and  sunsets 
taken  through  the  command  module  window  with  a HDC.  The  250-mm- 
focal-length  (12.5^  by  12.5^  field  of  view  (FOV) ) , f/5.6  lens  was 
fitted  with  a quality  glass  IR  filter  capable  of  obtaining  at  least 
three  orders  of  magnitude  blocking  in  the  visible  and  the  ultra- 
violet. Two  50-frame  Kodak  multispectral  IR  aerial  films  (ESTAR- 
AII  base)  SO-289  were  used  in  photography.  An  intervalometer 
automated  the  advance  of  each  frame  every  2.5  seconds  during  the 
experiment. 

C.  Balloonborne  "Dust sonde" 

The  University  of  Wyoming  balloonborne  dustsonde  was  used  in 
this  program  for  the  ground- truth  in  situ  aerosol  measurement  that 
was  made  from  the  Richards-Gebaur  Air  Force  Base  (Ref.  2) . Air 
was  sampled  during  balloon  ascent  and  parachute  descent  with  a 
2.5  liter  light-scattering  sizing  counter,  into  the  test  volume  of 
which  a well-defined  stream  of  air  is  pumped  at  approximately  0.75 
liter/minute.  Individual  particles  scatter  light  into  the  micro- 
scopes. The  light  pulses  are  amplified  and  by  using  pulse-height 
discrimination,  the  integral  concentration  of  aerosol  particles 
larger  than  0.3  and  0.5  urn  in  diameter  can  be  determined. 

The  background  noise  for  the  system,  mainly  due  to  Rayleigh 
scattering  from  the  air  molecules  in  the  chamber  at  low  altitudes 
and  from  cosmic-ray  scintillation  in  the  photomultiplier  glass  at 
high  altitudes,  was  measured  approximately  every  15  minutes  during 
the  flight  by  passing  filtered  air  through  the  chamber.  The  back- 
ground was  negligible  above  a 10-km  altitude;  and  it  was  corrected 
for  below  this  altitude.  The  dustsonde  was  also  equipped  with  a 
rawinsonde  temperature  element  for  recording  the  atmospheric 
temperature  profile. 
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D.  Lidar 


The  lidar  measurements  for  the  SAM  experiment  were  provided 
by  the  NASA  Lcingley  Research  Center  (LaRC)  122-cm  (48-in.)  laser 
radar  system,  which  consists  of  two  temperature-controlled  lasers 
(ruby  and  neodymium-doped  glass)  mounted  on  either  side  of  an  f/10 
Cassegrainian-configured  telescope  composed  of  a 122-cm  (48  in.) 
diamter,  f/2,  all-metal  primary  and  a 25.4-cm  (10  in.)  diameter 
secondary  (Ref.  3) . The  output  from  the  detector  package  is 
recorded  by  a high-speed  data  acquisition  system.  Analog  signals 
are  amplified  and  bandwidth  limited,  digitized  at  a 5-  or  10-MHz 
rate  with  8-bit  accuracy,  and  recorded  on  magnetic  tape.  A 16-bit- 
word  storage  computer  is  used  to  control  the  data  acquisition 
system  and  to  process  the  data.  An  X-band  microwave  radar,  coinci- 
dent with  the  laser  system  axis,  is  used  to  ensure  safe  operation 
in  the  atmosphere.  A rotating  shutter  reduces  laser  fluorescence 
after  Q switching. 


III.  RESULTS  OF  OBSERVATIONS 
A.  SAM  Photometry 

The  SAM  Experiment  was  designed  to  observe  four  events  on 
July  22,  1975.  During  Apollo  revolution  95,  measurements  were  made 
of  a sunset  observation  (00:07:04  Universal  Time  (UT) ) off  the  coast 
of  New  Jersey  (lat.  39^10*  N,  long.  72^45*  W)  and  of  a sunrise 
observation  (00:45:52  UT)  over  the  Indian  Ocean  off  the  coast  of 
Australia  (lat.  43*  S,  long.  99^55*  E) . During  Apollo  revolution 
96,  a sunset  observation  (01:37:52  UT)  was  taken  near  Kansas  City, 
Missouri  (lat.  38^57’  N,  long.  95^06*  W)  , followed  by  a sunrise 
observation  (02:14:39  UT)  over  the.  Indicin  Ocean  (lat.  42^55*  S, 
long.  77^39'  E) . The  SAM  photometer  was  used  to  obtain  radiometric 
measurements  during  each  of  these  four  events,  and  the  measured 
photometric  intensities  have  been  inverted  by  using  the  inversion 
procedures  outlined  below  to  obtain  total  extinction  as*  a function 
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of  altitude.  Figure  2 shows  the  sub  satellite  tracks  for  the  95th 
and  96th  revolutions  and  the  locations  of  the  tangent  points  of 
the  0-km  altitude  grazing  ray  of  the  Sun  for  the  four  events. 

B . Photography 

Photographs  were  made  of  the  first  sunrise  and  the  second  sun- 
set with  the  HDC  . Figure  3 is  a composite  of  frames  AST-28-2400 
to  AST-28-2406  taken  during  the  first  sunrise.  These  photographs 
have  been  printed  with  high  contrast  to  show  the  observed  refracted 
images  of  the  solar  disk  and  have  been  superimposed  on  a grid 
showing  the  horizon  and  the  tangent  altitudes.  Figure  4 shows 
frame  AST-28-2402,  an  "isodensity  tracing"  of  this  frame,  and  an 
outline  of  the  theoretical  computed  shape  of  the  Sun  as  expected 
for  the  orbital  and  atmospheric  conditions  present-  The  distri- 
bution of  the  observed  isophotes  is  due  to  limb  darkening,  extinc- 
tion by  atmospheric  constituents,  and  the  effects  of  refraction. 
These  photographs  confirm,  at  spacecraft  altitude,  that  effects 
due  to  refraction  must  be  considered  in  the  design  of  solar 


Fig.  2.  Sub-satellite  tracks  for  SAM  observations » 
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sec 

Fig.  3.  Composite  of  photographs  taken  during  first  sunrise 
event  (AST-28-2400  to  2406) . 


Fig.  4.  Sun-shape  comparison.  (a)  High-contrast  sun  photo- 
graph (AST-28-2402) ; (b)  Isodensitg  tracing;  and  (c)  Expected 
refracted  sun  shape. 
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occultation  experiments.  The  photographs  taken  during  the  SAM 
events  are  currently  being  analyzed  to  siibstantiate  theoretical 
models  that  are  under  development  for  use  in  future  occultation 
experiments. 


C.  Ground-Truth  Data 

Data  from  the  dustsonde  balloon  flight  launched  on  July  21, 
1975,  at  23:58  UT  are  shown  in  Figs,  5 to  7.  Figure  5 contains 
the  measured  temperature  profile.  Fig.  6 illustrates  the  measured 
aerosol  concentration  as  a function  of  altitude  for  particles 
larger  than  0.3  ym  in  diameter,  and  Fig. . 7 shows  the  aerosol  count 
ratio  (ratio  of  particles  greater  than  0.3  ym  in  diameter  to  par- 
ticles greater  than  0.5  ym  in  diameter)  as  a function  of  altitude. 


Fig.  5.  Measured  temperature  profile  above  Kansas  City, 
Missouri . 
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Fig.  6.  Measured  dustsonde  aerosol  concentration  (>0,3  urn 
diameter)  above  Kansas  City,  Missouri . Curved  lines  indicate 
mixing  ratio. 

The  LaRC  lidar  system  was  used  during  the  nights  of  July  22 
and  July  23  to  obtain  laser  backscatter  measurements  of  the  strato- 
spheric aerosols.  Figrure  8 shows  the  backscatter  ratios  obtained 
during  the  measurements  made  on  July  23  at  07:30  UT,  and  Fig.  9 
shows  the  measured  backscatter  ratios  made  on  July  22  at  04:51  UT. 
(Backscatter  ratio  is  the  ratio  of  total  observed  backscatter  to 
molecular  backscatter.) 
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Fig.  7.  Ratio  of  measured  aerosol  count  for  particles  >0.3  ym 
diameter  to  particles  >0.5  \xm  diameter  above  Kansas  City,  Missouri. 

IV.  METHODS  .OF  INVERSION. 

Two  methods  have  been  employed  for  the  inversion  of  the  SAM 
photometric  measurements  in  order  to  obtain  stratospheric  aerosol 
extinction.  Both  of  the  methods  assume  a model  of  the  stratosphere 
that  contains  horizontal  homogeneity.  The  first  method  has  been 
referred  to  as  the  onion  skin  peel  inversion  and  allows  for  the 
analytical  inversion  of  the  results.  The  second  method  involves  a 
technique  of  double  inversion. 

A.  Onion  Skin  Peel  Inversion 

For  this  method  of  inversion,  the  atmosphere  is  divided  into 
a number  of  homogeneous,  spherical  layers  that  are  considered  to 
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have  uniform  aerosol  concentrations.  Figure  10  shows  a cross 
section  of  the  measurement  geometry  and  illustrates  the  path  lengths 
of  solar  rays  at  different  atmospheric  layers.  Figure  11  shows 
the  inversion  geometry  and  illustrates  the  solar  intensity  contri- 
bution transmitted  through  the  different  layers  as  seen  from  the 
ASTP  instrument  during  a sunset  event. 


As  the  bottom  limb  of  the  Sun  becomes  tangent  to  each  of  the 
layers  of  the  onion  skin  atmospheric  model,  the  intensity  can  be 
computed  from  limb  darkening  and  refraction  by  summing  the  intensi- 
ties over  the  layers  above.  For  example,  following  the  notation 
for  the  layers  identified  in  Figs.  10  and  11,  one  finds  that  when 
the  lower  limb  is  tangent  to  the  first  and  second  layers,  the 
observed  solar  fluxes  are  given  by 


= ^10  + ^11 


^2  = "20  ^ "21  ^ "22 


(1) 


0 

where,  in  general,  F^  and  F2  are  the  observed  fluxes  and  and 
@ 

are  the  intensities  above  the  upper  layer  of  the  inversion 
model  (above  the  point  at  which  extinction  is  observed) . The 
intensities  ^21'  ^22  transmitted  intensities  to 

be  determined  by  inversion.  By  using  the  Lambert-Beer  law,  Eq.  (1) 
can  be  written  in  the  form 


Fig.  10.  Cross  section  of  the  onion  skin  atmospheric  model 
-showing  the  ray  path  lengths^  where  is  the  path  length  in  the 
jth  layer  for  the  solar  ray  tangent  to  the  ith  layer. 
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Fig.  11.  Inversion  geometry  showing  the  contributions  to  the 
total  transmitted  Sun  intensity , where  is  the  observed  intensity 
for  the  nth  layer,  In^  ^21'  ^22  transmitted  intensi- 
ties to  be  determined  by  inversion,  and  is  the  portion  of  the 

unattenuated  solar  intensity  that  contributes  to 


^10  ^1*1 

^2  ^20  ^21  ^xp(-32Pj2^  ^2*2  ” ^2^22^ 

0 

where  is  the  portion  of  the  unattenuated  solar  intensity 

through  the  ith  layer  that  contributes  to  the  observed  flux  F ; 

n 

is  the  path  length  in  the  jth  layer  for  the  solar  ray  tangent 
to  the  ith  layer;  and  3^  is  the  total  extinction  at  the  wavelength 
of  the  SAM  system  in  the  ith  layer.  Equation  (2)  yields 

(3) 
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This  process  can  be  continued  as  the  sunset  event  occurs; 
thus, allowing  for  the  determination  of  the  vertical  profile  of  the 
total  extinction.  Figures  12  and  13  show  the  results  of  the  onion 
skin  inversions  for  the  ASTP  sunrise  and  sunset  events. 


B.  Technique  of  Double  Inversion 

Using  the  measurements  of  the  solar  flux  from  the  full  solar 
disk,  during  a sunrise  or  sunset  event,  the  vertical  profile  of 
the  aerosol  volxime  extinction  coefficient  can  be  determined  by  a 
series  of  solutions  of  two  integral  equations.  Each  of  the  inte- 
gral equations  can  be  transformed  into  integral  equations  of  the 
first  kind  and,  therefore,  lend  themselves  to  solution  by  standard 
inversion  methods. 


The  first  of  these  equations  is 


F(t) 


0 

I (t,  X, 


y)T(x)  cos  dw 


(4) 


Fig.  12.  Results  of  onion  skin  inversions  for  3 TOTAL  *for 
sunrise  events. 
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Fig.  13.  Results  of  onion  skin  inversions  for  ^ 
sunset  events. 


where  F(t)  is  the  time -re solved  measured  flux,  w is  the  detector 

field  of  view  and  x and  y are  orthogonal  coordinates  in  the  field 

of  view.  The  y axis  has  been  taken  tangential  to  the  Earth's 

surface*  The  intensity  distribution  of  the  solar  disk  is  given 
0 

by  I (t,  X,  y) , transmission  through  the  Earth's  atmosphere  by 

0 

T(x) , and  the  angle  between  I (t,  x,  y)  and  the  detectors  normal 
by  . 

Equation  (4)  can  be  reduced  to 


F(t)  = 


m 0 

I (t,  x)T(x)dx 


(5) 


X 


0 0 
by  integrating  I (t,  x,  y)  over  y to  find  the  function  I (t,  x) . 

Equation  (5)  is  recognized  as  an  integral  equation  of  the 
first  kind  and  can  be  written  in  quadrature  form  as 


g(t)  = A(t,  x)f(x) 


(6) 
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where 


g(t)  = F(t) 

A(t,  x)  = W(x)  I^(t,  x) 
f (x)  = T(x) 


(7) 


Equation  (6)  can  be  solved  by  the  well-known  constrained 
linear  inversion  technique  developed  by  Phillips  (Ref.  4)  and 
Twomey  (Ref.  5) 

f = (a\  + Y DH)~^  A^’g  (8) 

The  second  integral  equation  involves  the  volume  extinction 
coefficient  B(Z)  for  the  altitude  Z and  can  be  found  by  con- 
sidering the  transmission  along  the  path  P which  is  given  by  Beer's 
law 

T(x)  = 


It  involves  the  evaluation  of  the  line  integral  and  can  be  put  in 
the  form  of  an  integral  equation  of  the  first  kind  as 


In 


/ ■> 

1 

It(x)J 

rz 


m 


3(Z) 


dP 

dZ 


(t,  Z)dZ 


(10) 


Equation  (10)  can  be  put  in  the  form  of  Eq.  (6)  by  making  the  sub- 
stitution 

(^fej  I 

A(t,  Z)  = w(Z)P(t,  Z)  I (11) 

f(Z)  = 3(Z)  3 

and  can  be  solved  for  3(Z)  by  the  same  method  as  the  first  of  the 

equations  of  the  inversion. 

The  linear  inversion  employed  to  obtain  the  solutions  for  both 
the  transmission  cuid  the  extinction  coefficient  made  use  of 
smoothing  constraints  (H  being  the  smoothing  matrix)  . That  is , 
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the  solution  was  constrained  so  that  the  second  derivitive  of  the 
function  was  minimized.  In  addition  to  this  constraint,  the 
diagonal  matrix  D allowed  for  smoothing  to  be  applied  to  the 
solution  as  a function  of  altitude.  The  elements  of  the  D matrix 
were  calculated  from  the  measured  fluxes  for  the  first  inversion 
and  from  the  transmissions  for  the  second  inversion.  They  reflect 
the  percent  error  in  each  measior ement . The  larger  the  measured 
fluxes  the  smaller  the  corresponding  extinction  coefficients.  The 
elements  of  the  matrix  apply  smoothing  differentially  with 
altitude. 

Figure  14  shows  the  inversions  for  obtained  from 

AEROSOL 

two  sunset  events.  The  molecular  extinction  coefficient  3 (2)  was 

m 

calculated  from  Rayleigh  scattering  theory  for  the  effective  wave- 
length of  the  SAM  system.  For  the  Sunset  II  observation,  the 

balloon  measured  temperature  profile  was  used  to  determine  3 (Z) . 

m 


Fig.  14. 
set  events. 


Results  of  double  inversion  for 


e 


AEROSOL 


for  sun- 
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For  the  sunset  I observation,  a standard  atmosphere  was  used. 

3.  , was  then  determined  by  subtracting  the  molecular  contri- 

Aerosol 

bution  from  the  inverted  results. 


3 


(z) 

Aerosol 


= 3(2)  - 3„(2) 

Itl 


V.  COMPARISON  OF  RESULTS 

The  balloon  dustsonde  measurements  of  number  density  of  par- 
ticles greater  than  0.3  ym  diameter  and  the  aerosol  count  ratio 
have  been  used  as  a function  of  altitude  to  fit  various  size  dis- 
tributions. These  size  distributions  have  been  used  to  predict 
the  SAM  photometer  measurements  of  the  aerosol  extinction  observed 
during  the  second  sunset  event  and  the  lidar  observations  of  the 
backscatter  ratio.  Figure  15  shows  a comparison  between  a plot  of 
the  SAM  inversion  results  obtained  for  the  second  sunset  event 
us^ng  the  onion  skin  peel  inversion  method  and  the  Mie-scattering 
computations  performed  using  the  log-normal  aerosol  size  distri- 
bution developed  by  Pinnick  et  al.  (Ref.  6).  Figure  16  compares 
the  results  of  the  double  inversion  technique  with  the  Mie- 
scattering  computations.  This  distribution  was  adjusted  as  a 
function  of  altitude  so  that  the  total  number  and  mode  radius  fit 
the  dustsonde  observations.  For  these  calculations,  the  index  of 
refraction  for  the  stratospheric  aerosol  was  taken  to  be  1.43  - 
i(o\o),  1.50  - i(O.O),  and  1.60  - i(O.O)  (where  i = / -1)  . 

Figure  17  shows  the  comparison  between  the  lidar  backscatter 
ratios  observed  on  the  night  of  July  22  and  the  computed  back- 
scatter ratios  for  index  of  refraction  1.43  - i(O.O),  1.50  - i(O.O), 
and  1.60  - i(O.O).  These  backscatter  ratios  were  computed  by  using 
the  same  size  distribution  fits  to  the  dustsonde  observations 
that  were  used  for  the  previously  mentioned  SAM  extinction  dust- 
sonde comparisons.  The  large  enhancement  at  approximately  13  km 
in  the  lidar  data  is  due  to  cirrus  clouds. 
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Fig.  15.  Comparison  of  SAM  results  from  onion  skin  inversion 
on  July  22,  1975  (seconJ.  sunset  event)  , and  Mie  computations  of 
dustsonde  results  on  July  22,  1975,  for  different  optical  proper- 
ties of  the  aerosols  _ = aerosol  extinction) . 

AEROSOL 


Results  of  the  three  different  techniques  agree  in  the  place- 
ment of  the  peak  altitude  of  aerosol  concentration,  and  they  are 
consistent  with  a single  size  distribution  and  particle  refractive 
index.  The  author  would  like  to  point  out  that  the  log-normal 
distribution  used  to  fit  these  observations  is  not  necessarily 
unique.  Other  distributions  might  fit  as  well. 
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Fig.  16.  Comparison  of  SAM  results  from  double  inversion  on 

July  22 f 1975  (second  sunset  event)  , and  Mie  computations  of  dust- 

sonde  results  on  July  22,  1975,  for  different  optical  properties 

of  the  aerosols  “ aerosol  extinction). 

AER  OSOij 
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Fig.  17.  Comparison  of  lidar  results  of  July  22,  1975,  and 
Mie  computations  of  dustsonde  results  on  July  22,  1975,  for  dif- 
ferent optical  properties  of  the  aerosols. 


SYMBOLS 


D diagonal  matrix 

flux  in  nth  layer 
H smoothing  matrix 

intensity  in  nth  layer 

0 

portion  of  the  unattenuated  solar  intensity  through  the 
ith  layer  that  contributes  to  the  observed  flux  F 

n 

n layer  number 

P path  length 

path  length  in  the  jth  layer  for  ray  tangent  to  the  ith 

layer 

t time  resolved  measured  flux 

transmittance  in  the  ith  layer 
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w 


detector  field  of  view 
X orthogonal  coordinate  in  field  of  view 

y orthogonal  coordinate  in  field  of  view 

Z altitude 

g(Z)  total  volme  extinction  coefficient  3oti  ^ at 

altitude  Z 

3,  , (Z)  aerosol  volume  extinction  coefficient  km  ^ at 

Aerosol 

altitude  Z 

3 (Z)  molecular  volimie  extinction  coefficient  km  ^ at 

m 

altitude  Z 

3 total  extinction  at  wavelength  of  SAM  system  in 

n 

nth  layer 

(p  intensity  distribution  of  solar  disk 

0 angle  between  I (t,  x,  y)  and  detectors  normal 

0 superscript  denoting  Sun 

f 
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DISCUSSIONS 


• T 


Shettle:  On  your  deviation  at  the  upper  altitudes,  one  possibility 

I think  there  could  be  some  additional  large  particles.  Some  of 
the  curves  that  Frariklin  Harris  showed  indicated  size  distributions 
that  are  suggestive  of  a kink  with  a somewhat  less  deep  slope.  You 
are  measuring  particles  .15  and  .25  pm  there  and  when  you  get  out  to 
one  micrometer , the  number  densities  are  sufficiently  low  that  they 
don’t  show  up  at  all  in  your  data. 

Pepin:  As  you  will  notice,^  the  size  ratio  is  getting  larger  at 

high  altitudes.  The  size  ratio  is  the  number  of  small  particles 
to  the  number  of  large  particles.  And,  consequently,  the  balloon 
data  indicates  that  the  distribution  is  getting  steeper  rather 
than  flatter  at  the  high  altitudes. 

Shettle:  Yes.  I am  suggesting  a second  component  which  mainly 

contributes  out  in  one  micrometer  region. 

Pepin:  That  would  be  very  interesting  if  there  is  a second  size 

distribution  at  high  altitudes. 

Shettle:  It  is  suggestive  of  meteoric  dust  coming  down.  I know 

I talked  about  it  earlier  this  week-  Another  possibility  that  the 
size  of  the  solar  disk  is  getting  larger  that  you  see.  What  about 
possible  uncertainty  in  the  intensity  across  the  solar  disk? 

Pepin:  The  brightness  theorem  has  to  be  satisfied  and  the  refrac- 

tion effects  are  getting  very  small  at  those  altitudes. 

Consequently,  the  errors  that  one  would  make  in  making  an  estimate 
are  going  to  zero  very  quickly  at  those  altitudes.  It  is  low  in 
the  atmosphere  where  the  refractive  effects  are  large-  x 

Park:  How  much  do  your  inversion  results  depend  on  the  pressure 

profile  you  adopt  in  the  retrieval? 

Pepin:  It  is  independent  of  pressure.  All  I have  to  do  is  to 

know  roughly  where  the  ray  goes  through  the  atmosphere. 

Park:  But  you  have  to  know  the  pressure  profile. 

Pepin:  To  first  order,  then  I can  determine  the  entire  atmosphere 

independent  of  the  geometrical  properties  of  the  atmosphere, 
because  the  spacecraft  scans  with  a uniform  rate  across  the  atmos- 
phere- It  is.  only  the  second  order  coupling  with  the  refracting 
geometry  that  produces  the  errors  in  the  experiment. 
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Fraser:  In  your  first  figure,  I think  you  showed  the  refraction 

effects  of  the  sunlight.  It  looked  to  me  as  though  when  the  sun 
got  closer  to  the  horizon,  the  horizontal  dimensions  decreased 
significantly.  Why  was  that? 

2 

Pepin:  (Showing  graph  ) . I think  maybe  that  was  an  optical 

illusion,  because  you  can  very  carefully  compare  the  horizontal 
distances  across  each  of  these  images. 

Fraser:  Are  they  the  same? 

Pepin:  Yes,  they  are  the  same. 

Green:  Aren't  stratospheric  aerosols  predominantly  smaller  than 

tropospheric  aerosols? 

Pepin:  Yes,  I think  they  are  very  often.  The  peak  seems  to  be 

at  small  sizes  for  stratospheric  aerosols.  Whereas  Ben  showed 
with  his  inversions  very  often  there  is  a second  peak  for  the 
tropospheric  aerosols — ^move  out  to  something  of  the  order  of  half 
to  one  micron  in  size. 

Green:  So  might  you  not  be  in  a regime  where  the  smaller  particles 

would  best  be  described  by  1/A^,  like  Rayleigh  particles,  so  that 
perhaps  a better  parameterization  would  be  something  which  inter- 
polates 1/X  to  1/X^? 

Twitty:  I think  we  have  seen  a lot  of  evidence  in  the  last  hour 

that  indicates  that  the  stratospheric  aerosols  variation  of  its 
scattering  coefficient  with  eavelength  is  more  like  1/X  , not  1/X^. 

Green:  If  you  have  lots  of  small  particles,  you  would  be  analogous 

to  the  situation  which  Ben  was  indicating.  Perhaps  a better  two- 
parameter  description  would  be — not  a*  single  power  law — but  one 
that  interpolates  between  two  powers.  You  could  perhaps  still 
find  a two-parameter  description  that  would  do  that,  but.  . . 

Twitty:  Well,  you  could  parameterize  it  that  way.  The  argument 

earlier  was  that  you  cannot  possibly  get  any  information  about  the 
two  parameters  out.  The  question  of  what  you  are  putting  into 
your  model  that  you  think  is  the  closest  representation  is  open  to 
your  own  decision.  I think  you  talked  about  nonpower  laws  in 
your  simulated  analysis;  you  are  using  log  normal  laws. 
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Pepin:  Yes,  in  fact,  I think  you  can  just  look  at  this  measurement 

that  was  taken  in  Kansas  City.  It  is  typical  of  aerosols  in  the 
stratosphere.  You  will  notice,  in  the  stratosphere,  that  between 
the  0.3  and  0.5-ym  diameter  size  particles,  there  is  a factor  of 
like  4 or  5 in  terns  of  the  number  of  particles  larger  than  that 
size.  If  you  go  to  0-01,  the  number  is  ten-  So  what  this  is 
saying  is  that  the  distribution  has  peaked  around  the  0-1  to  0.2 
micrometer  region. 
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EFFECTIVE  AEROSOL  OPTICAL  PARAMETERS  FROM 


POLARIMETER  MEASUREMENTS 


Jacob  G-  Kuriyan 
University  of  California 


In  this  paper,  the  theory  underlying  the  interpretation 
of  polarimeter  measurements  is  described.  The  assumptions  of 
the  model  are  carefully  stated  so  that  the  results  obtained 
from  the  ground-based  experiment  can  be  understood  without 
ambiguity , The  meteorological  significance  of  the  parameters 
is  also  deduced  in  the  paper.  With  a satellite-borne  polari- 
meter that  monitors  the  upwelling  radiation  field, the  effect 
of  the  ground  must  be  taken  into  account  in  order  to  obtain 
the  aerosol  parameters.  Two  methods  that  hold  promise  are 
described. 


1,  INTRODUCTION 

A great  deal  of  attention  has  centered  on  the  impact  of  arti- 
ficial and  natural  pollutants  on  man.  Of  particular  interest  to 
meteorologists  is  their  effect  on  the  heat  budget  of  the  atmosphere 
so  as  to  estimate  the  modifications  they  induce  on  weather  and 
climate.  This  paper  shall  address  itself  to  the  deteinciination  of 
the  optical  properties  of  atmospheric  particulates  that  are  of 
significance  to  meteorologists. 

The  atmosphere  is  often  likened  to  an  engine  driven  by  the 
solar  energy  that  is  absorbed  by  the  Earth- atmosphere  system.  In 
numerical  studies  of  the  circulation  of  the  atmosphere,  the  heat 
budget  of  the  atmosphere  is  a significant  input  to  the  problem. 

As  the  general  circulation  models  have  become  more  sophisticated, 
it  has  become  necessary  to  take  into  account  the  radiative  effects 
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of  aerosols/  vg^hich  can  be  viewed  as  a perturbative  correction  to 
the  other  terms  in  the  heat  budget. 

The  aerosol  effects  are  but  a minor  (yet  significant,  in  some 
cases)  correction  to  the  total  heat  budget  calculation,  and  so, 
unless  the  scheme  that  is  devised  to  incorporate  these  effects  is 
simple,  meteorologists  will  find  it  easy  to  continue  to  ignore 
their  effects.  Theoretical  techniques  are  available  for  calcu- 
lation of  radiative  properties  of  aerosols  of  pathological  shapes 
and  distributions,  but  we  must  resist  the  temptation  of  using  these 
models  unless  irrefutable  evidence  is  presented  that  simpler  models 
are  insufficient.  The  needs  of  the  user  (in  the  present  or  fore- 
seeable future)  dictates  that  simple  models  be  used  and  then 
empirical  formulae  (involving  the  aerosol  model  parameters)  for  the 
radiative  effects  be  derived  so  as  to  be  used  in  the  circulation 
models.  We  may  compare  this  to  a method  sometimes  employed  in 
French  cuisine:  a piece  of  pheasant  meat  is  cooked  between  two 
slices  of  veal,  which  are  then  discarded  (Ref.  1) . The  experi- 
mental determination  of  the  model  parameters  will  then  assist  in 
the  heat  budget  calculation. 

Most  of  the  radiative  transfer  programs  model  aerosols  as  a 
collection  of  spherical  (Mie)  scatterers,  with  a uniform  refrac- 
tive index  and  their  size  distribution  described  by  a function  n(r) . 
The  parameters  of  the  model  are  then  the  refractive  index  m (in 
general  complex)  and  parameters  in  the  size  distribution  function. 

In  reality,  atmospheric  aerosol  particles  are  not  all  spherical 
nor  do  they  have  a uniform  refractive  index.  It  is  often  argued 
that  aerosol  particles,  no  matter  what  their  shdpe;  'acquire  a water 
coating  and  become  spherical.  This  heuristic  argument,  reasonable 
though  it  may  be,  should  not  be  confused  as  a substitute  for  proof 
of  our  hypothesis.  Rigorously  speaking,  the  intent  of  the  approach 
adopted  here  is  to  describe  the  effects  of  atmospheric  aerosols  in 
terms  of  a simple  model.  There  is  no  prior  assuraince  that  it  is 
possible  to  incorporate  all  the  observed  aerosol  effects  in  such 
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a simple  model.  Thus,  initially,  the  experiments  must  be  used  to 
justify  the  model.  That  is  to  say,  not  only  must  the  model 
parameters  be  determined,  but  the  experimental  proof  of  the  self- 
consistency  of  the  method  must  be  displayed.  If  the  results  of 
the  simple  model  cannot  account  for  the  measurements  then,  and 
only  then,  is  the  consideration  of  more  complicated  aerosol  models 
justified. 

To  digress  for  a moment,  in  the  literature,  the  laboratory 
measurements  of  scattering  off  irregular  particles  floating  in  a 
nitrogen  jet  are  compared  with  the  scattering  patterns  of  spherical 
particles.  The  lack  of  agreement  is  then  used  as  a justification 
for  considering  horrendously  complicated  aerosol  models  that  can 
only  be  handled  by  statistical  methods.  But  it  will  be  found  that 
the  ground-based  measurements  of  the  scattered  radiation  field  in 
the  atmosphere  (that  have  been  obtained  at  UCLA  for  the  last  two 
years)  can,  indeed,  be  explained  in  terms  of  the  simple  model  des- 
cribed. Perhaps  it  is  safe  to  conclude  that  irregular  particles 
floating  in  a nitrogen  jet  do  not  simulate  the  behavior  of  the 
atmosphere . 

The  experiments  conducted  by  the  late  Professor  Z.  Sekera  at 
UCLA  established  that  the  presence  of  particulates  in  the  atmos- 
phere alters  the  polarization  characteristic  of  skylight.  How  the 
polarization  measurements  can  be  used  to  infer  the  aerosol  model 
parameters,  and  the  self-consistency  checks  that  have  been  devised 
to  verify  the  validity  of  the  assumptions  of  the  aerosol  model  are 
described. 

The  method  of  approach  is  to  construct  the  radiation  field  for 
all  possible  values  of  the  aerosol  parameters.  Admittedly,  this 
is  primitive,  and  the  viability  of  the  method  hinges  crucially  on 
the  table  being  complete  and  on  the  development  of  an  efficient 
algorithm  that  can  help  match  measurements  and  calculations. 
Redundancy  in  the  model  can  increase  the  size  of  the  tables  and 
frustrate  the  search  for  a fit. 
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II.  CONSTRUCTION  OF  THE  TABLES  AND  CHOICE  OF  A SIZE  DISTRIBUTION 

The  compilation  of  a complete  table  of  radiation  field  in  a 

turbid  atmosphere  necessarily  involves  the  identification  of  a 

model  that  is  of  sufficient  generality  that  it  can  accommodate  the 

commonly  occurring  radiation  field  patterns.  Consistent  with  our 

philosophy  of  selecting  the  simplest  such  model,  we  considered  the 

power  law  distribution  due  to  Junge,  n(r)  = a/r^  ^ a deceptively 

simple  functional  representation-  It  is  often  assumed  that  there 

are  only  two  parameters  in  this  representation,  a (proportional  to 

the  concentration)  and  v (that  determines  the  distribution  in 

sizes).  Actually,  however,  the  integrals  involving  n(r)  require 

a cut-off  at  r . and/or  r , and  the  results  can  be  sensitive  to 
mxn  max 

the  value  of  the  cut-off  point,  i.e.,  the  cut-off  point  becomes  a 
parameter  in  the  model.  This  defici-ency  of  the  power  law  distri- 
bution is  not  well  recognized  in  the  literature  and  so  it  would  be 
wJrth  while  to  illustrate  this  with  an  example.  The  optical  depth 
T in  our  model  is  given  by  t = tt  / r^n(r)Q^^^(x)  dr,  where 
is  the  van  de  Hulst  extinction  efficiency  factor.  Let  us  examine ^ 
the  integral  at  the  lower  limit  of  integration.  For  small  values 
of  r,  when  m is  complex,  from  van  de  Hulst *s  book  we  find  that 
Q 2^^(x)  X,  where  x = kr.  Thus,  the  integrand  in  the  expression 

V — 2 

for  T behaves  like  1/r  . When  v > 3,  a condition  that  occurs 

quite  often,  the  lower  limit  of  the  integral  will  behave  worse 

than  Zn  r . . Thus,  when  v > 3,  r . can  become  a significant 

min  min 

parameter  in  the  calculation  of  t . This  is  an  undesirable  and 
unphysical  restriction  on  this  model. 

The  modified  gamma  di^stribution  used  by  Deirmendjian  is  well 

behaved  at  both  limits  of  r and  his  analysis  of  commonly  occurring 

aerosols  led  him  to  classify  them  into  three  types — haze 

H(a„r^exp(-b„r) ) , haze  L(a  r^exp(-b  v^) ) andhaze  M(a  r exp(-b  >/r) ) , 

H H L L MM 

with  a , a , a , b , b , and  b being  assigned  fixed  values.  If 
H L M H L M 

we  accept  this  classification  of  Deirmendjian,  then  the  most  gen- 
eral aerosol  distribution  would  be  an  admixture  of  these  three 
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types  and  . we  would  be  led  to  a tri-modal  distribution  with  the 
modes  determined  by  the  values  of  the  various  b*s.  The  admixture 
ratio  would  have  to  be  height  dependent  and,  thus,  the  problem  will 
be  complicated  even  further.  This  goes  against  the  grain  of  our 
philosophy,  which  is  to  start  with  the  simplest  possible  model  at 
the  outset,  and  introduce  complications  only  when  the  simpler  models 
are  found  to  be  deficient  in  accounting  for  the  measurements. 

Using  analytical  approximations,  we  investigated  (Ref.  2)  the 
redundancy  of  the  description  of  aerosols  and  we  found  that  the 
gamma  (or  the  haze  H type)  distribution  n(r)  = ar^exp(-br),  with 
a and  b considered  as  arbitrary  parameters  could  simulate  the 
results  due  to  haze  M or  haze  L.  This  feature  was  found,  to  persist 
even  when  the  exact  (Mie)  theory  was  used.  That  is  to  say,  the 
haze  H type  distribution  could  produce  the  same  phase  matrix  ele- 
ment as  the  three  haze  distributions  used  by  Deirmend j ian . Thus, 
for  a start,  we  are  led  to  use  the  haze  H type  distribution,  with 
two  parameters,  a (proportional  to  concentration)  and  b (inversely 
related  to  the  modal  radius) , and  our  table  entries  would  corre- 
spond to  various  values  of  these  and  other  parameters  (such  as 
refractive  index,  wavelength,  ground  albedo,  etc.).  We  are  at 
liberty  to  expand  the  table  by  the  inclusion  of  other  independent 
models,  if  experiments  warrant  it. 

It  is  important  to  appreciate  the  implications  of  the  redun- 
dancy that  is  described  in  the  preceding  paragraph.  Many  different 
size  distributions  give  rise  to  the  same  phase  matrix  element  and, 
hence,  the  same  radiation  field.  Thus,  from  a measurement  of  the 
radiation  field,  we  will  not  be  led  to  a unique  size  distribution 
but  to  a class  of  distributions,  all  equivalent  to  one  another  in 
that  they  correspond  to  the  same  radiation  field  (and,  hence, 
fluxes) . This  observation  is  perfectly  in  consonance  with  our 
attempt  to  model  the  actual  atmospheric  aerosols  in  terns  of  an 
idealized  collection  of  spherical  scatterers,  and  what  we  are 
observing  is  that  there  is  still  another  degeneracy  allowed  in 
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these  models,  in  that  even  the  spherical  scatterers  can  have  dif- 
ferent size  distributions  and  yet  yield  the  same  phase  matrix  ele- 
ment. Needless  to  say,  we  will  expect  that  the  data  can  be  fitted 
by  other  distributions,  such  as  log-normal  or  multimodal,  but  in 
all  cases  the  fluxes  derived  must  be  the  same.  (This  statement  on 
fluxes  has  been  proved  only  for  the  Deirmendjian  distributions. 

It  is  anticipated  that  the  result  will  hold  for  other  distributions 
as  well.) 

Let  us  then  enumerate  the  assumptions  inherent  in  the  equiva- 
lent aerosol  model  and  in  the  calculation  of  the  tables  of  radia- 
tion field. 

1.  Aerosols  are  spherical  scatterers  and  so  Mie  theory  can  be 
used. 

2.  Aerosols  have  a uniform  refractive  index. 

3.  Size  distribution  of  aerosols  is  given  by  n(r)  = ar^exp(-br), 
where  the  concentration  (related  to  a)  varies  with  height - 

4.  The  atmosphere  is  assumed  plane  parallel  and  so  there  is  a 
horizontal  homogeneity  of  aerosols. 

5.  Multiple  scattering  code  (that  due  to  J-V.  Dave,  developed 
for  NASA)  is  used,  to  calculate  the  radiation  field. 

6.  The  ground  reflection  is  assumed  to  be  Lambertian,  an 
assumption  that  is  not  serious  in  the  study  of  the  downwelling 
radiation  but  quite  unacceptable  for  the  upwelling  radiation 
studies  (as  in  a satellite  borne  ^experiment) . We  will  comment  on 
this  later  in  the  paper. 

The  multiple  scattering  calculations  are  performed  iteratively. 
When  the  size  of  the  angular  integration  is  small, ^the  error 
decreases  but  the  cost  of  computation  increases.  Thus,  there  is 
an  optimal  angular  integration  step  that  is  used  in  the  calculation 
of  the  entries  of  the  table  and,  corresponding  to  this,  there  is 
an  error  in  the  calculated  radiation  field.  Any  fit  with  experi- 
ment need  be  only  consistent  within  these  error  bounds. 
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So  as  to  develop  the  algorithm,  we  studied  the  sensitivity  of 
the  radiation  field  to  the  variation  of  the  aerosol  model  param- 
eters (Refs.  3,  4).  For  our  analysis,  we  considered  a conical 
scan  of  the  sky,  with  a fixed  zenith  angle  of  observation  and 
variation  of  the  azimuth  angle  (f)(with  respect  to  the  Sun  plane) 
as  in  Fig.  1.  We  did  not  include  the  almucantar  sweep  since  our 
polarimeter  was  not  designed  to  view  the  direct  beam  of  the  Sun- 
In  this  type  of  a scan  the  intensity  curve  (I  vs.  (j>)  exhibits  a 
bell-shaped  structure,  and  it  was  found  that  the  half -width  of 
the  intensity  curve  was  determined  mainly  by  b.  [This  is  a phys- 
ically reasonable  condition  since  the  width  of  the  intensity  curve 
is  the  forward  peak  of  scattering  and  that  is  sensitive  to  the  size 
of  the  particles;  b is  a parameter  that  describes  (the  inverse  of) 
the  average  (modal)  radius  of  the  distribution.]  Thus,  our 
algorithm  consists  of  choosing  a reasonable  value  (but  fixed  for  a 
location)  for  the  ground  albedo,  then  determining  b by  examining 
the  width  of  the  curve  of  I against  (}>,  and  then  determining  t and 
m by  examining  the  curve  of  P against  <j),  and  the  tail  of  the  curve 
of  I against  (J).  This  is  actually  a graphical  algorithm  more  diffi- 
cult to  explain  than  to  use.  Numerical  simulations  were  used  to 
check  the  algorithm  that  are  more  difficult  to  explain  than  to  use. 
Niamerical  simulations  were  used  to  check  the  algorithm.  These  were 
found  satisfactory  and,  thus,  we  were  ready  to  examine  the  field 
of  measurements. 

III.  THE  FIELD  MEASUREMENT  PROGRAM  AT  UCLA 

A ground-based  polarimeter  was  used  to  measure  the  intensity 
and  polarization  of  skylight.  The  scan  of  the  sky  was  the  conical 
scan  described  in  Fig.  1.  The  measurements  were  made  at  2 wave- 
lengths and  at  several  (conical  scans)  zenith  angles  of  obser- 
vation. From  a measurement  of  the  I and  P in  one  conical  scan  (at 
various  azimuth  angles) , we  infer  the  aerosol  parameters  using  the 
algorithm  that  was  developed.  These  inferred  aerosol  parameters 
can  be  inserted  in  the  radiative  transfer  code  and  the  radiation 
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Fig*  i.  The  conical  scan  mode  of  the  ground-based  polarimeter 
(P)  * The  azimuth  (j)  is  defined  with  respect  to  the  Sun  vertical 
plane  (hatched) * 0 and  9^  are  the  observation  and  solar  zenith 
angles,  respectively . 

field  (I  and  P)  can  be  calculated  at  the  other  wavelength  and  the 
other  zenith  angles  of  observation.  If  these  calculations  compare 
favorably  with  the  measurements  at  the  other  wavelength  and  other 
zenith  angles  of  observation,  then  we  have  a consistency  check  on 
the  assumption  of  the  model.  If  they  do  not  match,  then  we  will 
have  to  analyze  the  assumptions  in  the  model  and  see  if  any  of  the 
assumptions  have  to  be  relaxed  to  fit  the  data.  For  instance,  if 
there  are  horizontal  inhomogeneities,  as  when  one  air  mass  is 
replacing  the  other,  we  would  find  some  disparity  in  the  multi- 
angular  scans. 

These  measurements  were  carried  out  at. various  locations  so  as 
to  determine  if  the  experimentally  inferred  effective  parameters 
had  any  meteorological  significance  (Ref.  5).  For  instance,  if 


the  aerosols  near  the  coastal  area  are  monitored,  there  would  be  a 
preponderance  of  marine  aerosol  particles,  and  the  refractive  index 
would  be  close  to  that  of  water.  So  also  the  desert  aerosol  must 
yield  indices  of  refraction  closer  to  that  of  silicates  (near  1.55) 
Santa  Ana  conditions  (dry  dusty  winds)  that  bring  clear  days  at 
Los  Angeles  should  have  particles  of  small  size  and  so  on. 

A large  number  of  measurements  were  made  over  these  chosen 
sites  at  various  times  of  the  year  with  the  idea  of  arriving  at 
average  values  of  these  effective  parameters  for  various  geo- 
graphical cind  meteorological  situations. 

IV.  RESULTS  OF  THE  GROUND-BASED  MEASUREMENT  PROGRAM 

Measurements,  indeed,  validated  the  assumptions  of  the  model. 
Figures  2 and  3 were  made  at  two  different  wavelengths,  one  after 
the  other,  and  we  find  that  the  parameters,  indeed,  are  the  same. 
Note  that  the  optical  depth  at  X =0.575  ym  is  greater  than  that  at 
X =0.7  ym,  as  it  ought  to  be.  Figures  3 and  4 show  the  multiple 
zenith  angle  correlation.  Again,  the  parameters  derived  are  the 
same.  We  have  found  this  type  of  a correlation  in  numerous  measure 
ments.  Further,  we  find  that  the  effective  parameters  do  have  a 
meteorological  significance,  the  refractive  index  approaching  1.34 
when  there  is  marine-type  aerosols  and  approaching  1.54  when  the 
dry  desert  aerosols  are  present.  The  extensive  measurements  that 
we  have  carried  out  also  confirms  the  fact  that  it  is  possible  to 
characterize  aerosols  occurring  during  certain  meteorological  con- 
ditions with  particular  average  values.  This  is  simunarized  in  the 
table.  It  is  now  possible  to  assign  representative  values  for 
these  aerosol  parameters  based  on  the  geographical  location  and 
use  these  in  the  heat  budget  calculations.  We  are,  of  course, 
extending  the  local  measurements  made  in  Los  Angeles  basin  on  a 
global  basis,  a procedure  that  is  reasonable  only  because  the  Los 
Angeles  basin  includes  within  it  representative  regions  of  the 
coastal,  urban  and  desert  air  masses. 
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Fig.  2.  May  22,  1976  at  UCLA.  Solar  zenith  angle  = 40^, 
observation  zenith  angle  = 6(P , wavelength  (X)  — 0*7  y.  Inferred 
aerosol  parameters  (assuming  ground  albedo  (A)  = 0.2)  are:  m = 1.5, 
b = 20,  T = 0.15. 


The  parameters  that  are  determined  have  the  following  error 
bounds:  6b  = 2,  6t  = 0.02,  6m  = 0.02-  It  is  possible  to  sharpen 

these  error  bounds  by  having  a more  detailed  catalog  of  tables. 

But  it  is  not  clear  that  this  is  needed  for  the  use  that  we  envision 
for  the  inferred  parameters.  In  some  cases,  we  have  found  that  by 
choosing  a Haze  M representation  we  will  be  able  ^to- obtain  better 
agreement  with  the  experimental  measurements,  i.e. , the  parameters 
can  be  determined  with  better  accuracy.  . In  all  cases,  however,  we 
were  able  to  fit  the  data  within  the  limits  of  the  errors  given  pre- 
viously. This,  of  course,  means  that  if  we  can  live  with  that -amount 
of  error  in  the  parameters,  then  our  model  is  quite  satisfactory. 
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Fig.  3.  May  22,  1976  at  UCLA.  Solar  zenith  angle  = 40^, 
observation  zenith  angle  - 50^ , wavelength  (\)  = 0.575  y.  Inferred 
aerosol  parameters  (assuming  ground  albedo  (A)  = 0.2)  are:  m = 1.5, 
b = 18,  X = 0.20. 


V.  STUDY  OF  UPWELLING  RADIATION — SATELLITE  BORNE  POLARIMETER 

The  radiative  transfer  equation  is  an  integral  equation. 
Therefore,  in  order  to  determine  the  radiation  field  in  any  one 
direction,  it  is  necessary  to  know  it  in  every  other  direction. 
Thus,  the  check  of  the  assiamptions  involved  in  the  model  has 
ensured  us  the  validity  of  the  calculations  of  the  upwelling 
radiation  field.  However,  the  experimental  check  of  this  is  com- 
plicated by  the  presence  of  ground  reflection.  The  model  that  we 
use  for  the  ground,  as  a Lambertian  source,  while  satisfactory  for 
the  study  of  the  downwelling  radiation  (since  the  ground  signal  is 
only  an  insignificant  part  of  the  downwelling  radiation) , is 
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Fig,  4,  May  22,  1976  at  UCLA,  Solar  zenith  angle  = 40^, 
observation  zenith  angle  = 60^ , wavelength  (X)  = 0,575  y.  Inferred 
aerosol  parameters  (assuming  ground  albedo  (A)  = 0,2)  are:  m = 1,5, 
2^  = 20,  T = 0,15, 


totally  unsatisfactory  for  the  study  of  upwelling  radiation.  In 
fact,  in  the  long  wavelength  region  (near  0.7  ym) , the  signal  from 
the  ground  completely  dominates  the  aerosol  and  the  molecular  part 
and  can  be  used  as  a monitor  of  the  ground  reflection  property. 
This  can  be  introduced  in  the  short  wavelength  radiative  transfer 
calculation  as  the  ground  source.  The  preliminary  calculations  we 
have  done  seem  to  justify  this  approach.  We  will,  however,  con- 
sider the  simpler  case  of  the  radiation  field  over  the  ocean,  off 
from  the  specular  region,  where  it  is  not  unreasonable  to  expect  a 
case  of  zero  ground  reflection.  In  actuality,  however,  the  specu- 
lar component  due  to  the  water  must  also  be  modeled  and  included 
with  some  gross  features  of  sea-surface  roughness. 
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As  a preliminary  effort  we  have  ignored  these  effects  and 
assumed  that  the  sea  surface  is  a near~zero  reflection  surface  and 
analyzed  the  upwelling  radiation  in  order  to  determine  the  algo- 
rithm for  inference  of  the  aerosol  parameters.  The  analysis  is 
made  more  difficult  here  by  the  fact  that  the  observation  geometry 
must  also  be  consistent  with  the  satellite  path.  We  have  con- 
sidered the  mode  of  observation  where  the  target  is  situated  on 
the  earth  in  the  satellite's  path,  and  the  polar imeter  will  lock 
itself  onto  the  target,  viewing  it  (in  the  satellites  orbit  plane) 
as  the  satellite  continues  in  its  orbit.  The  observation  angles 
will  be  restricted  such  that  the  specular  component  of  the  reflec- 
ted light  is  not  encountered  by  the  instrument.  The  rules  thajt 
have  been  developed  for  the  algorithm  are  quite  different,  but  we 
do  find  a marked  difference  in  the  intensity  and  the  polarization 
(Fig.  5)  patterns  as  the  aerosol  parameters  are  varied.  One  other 
difference  in  this  mode  of  analysis  is  that  we  find  it  is  neces- 
sary to  determine  the  actual  intensity  values  and  not  just  the 
normalized  values.  In  the  absence  of  measurements,  we  have  checked 
the  algorithm  that  has  been  developed  against  numerical  simulations. 
We  find  that  the  algorithm  is  quite  satisfactory.  Details  of  these 
analyses  are  expected  to  be  published  at  a later  date. 

VI . CONCLUDING  REMARKS 

Theoretical  studies  of  aerosol  radiative  effects  suggest  that 

o 

the  heating  and  cooling  due  to  aerosols  can  be  as  much  as  1 per 
day,  and  in  order  to  incorporate  their  effects  in  the  heat  budget 
of  the  atmosphere,  we  must  know  the  values  of  the^  parameters  of 
the  aerosol  model  (that  was  used  to  estimate  their  heating  and 
cooling  effect) . Thus,  local  measurements  can  yield  the  actual 
values  of  the  aerosol  parameters  but  they  are  not  the  values  that 
we  use  in  flux  calculations.  Radiative  calculations  involve  bulk 
properties  of  atmospheric  aerosols  and  it  is  these  that  must  be 
experimentally  determined.  It  is  possible  to  use  the  exact 
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calculations  to  derive  empirical  formuias  for  the  fluxes  in  terms 
of  the  aerosol  model  parameters  and,,  thus,  the  incorporation  of 
the  flux  calculation  in  the  circulation  model  will  be  rather  inex- 
pensive and  simple. 

Rather  than  consider  very  complicated  models  for  aerosols,  we 
decided  to  use  the  haze  H type  distribution.  In  the  literature, 
it  is  quite  usual  to  find  exotic  multi-modal  and  other  distributions, 
but  for  our  purposes,  to  obtain  a rough  estimate  of  all  the  param- 
eters, the  simpler  description  will  suffice.  There  was,  however, 
a need  to  consider  the  consistency  of  the  whole  method.  We  have 
devised  such  a scheme,  where  multiangular  and  multispectral 
measurements  are  used  to  verify  the  internal  consistency  of  the 
method.  The  large  mmiber  of  experiments  performed  at  UCLA  with  a 
ground-based  polarimeter  have  verified  these  hypotheses.  (The 
experiments  off  the  coast  of  Dakar,  as  part  of  the  GARP's^  Atlantic 
Tropical  Experiment  (GATE)  project,  yielded  aerosol  parameters  that 
seem  to  be  of  the  same  magnitude  as  those  determined  by  using  other 
methods  by  Quenzel,  Prospero  and  others.) 

After  settling  the  question  of  the  internal  consistency  of  the 
method,  our  next  task  was  to  infer  the  aerosol  parameters  of  rep- 
resentative air  masses.  Los  Angeles  basin  has  within  its  environs 
urban,  marine,  and  desert  type  (geographic)  locations.  It  seemed 
sensible  to  monitor  the  air  masses  in  these  regions  and  deduce 
some  average  values  of  the  relevant  parameters  so  as  to  obtain  cin 
aerosol  climatology  of  sorts.  This  measurement  program  yielded  the 
parameters  siimmarized  in  the  table.  We  also  found  that  typical 
meteorological  conditions  resulted  in  predictable  changes  in  the 
aerosol  parameters.  For  instance,  it  was  found  that  the  marine  air 
masses  invariably  resulted  in  lowering  of  the  refractive  index  to 
near  that  of  water  and  Santa  Ana  winds  yielded  smaller  particles 
with  higher  refractive  index . 

^Global  Atmospheric  Research  Program 
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The  imaginary  part  of  the  index  of  refraction  was  less  than 
0*02.  To  determine  this  better,  we  would  have  to  obtain  a 
catalog^  of  radiation  field  that  is  finer  than  what  we  have.  This 
is  not  a very  serious  limitation,  because  we  Ccoi  easily  state  the 
error  that  this  imprecision  in  the  imaginary  part  introduces  in  the 
flux  calculations.  There  is  a feeling,  in  certain  sections  of  the 
scientific  community,  that  the  imaginary  part  of  the  aerosol  refrac- 
tive index  is  the  sole  parameter  of  interest  to  weather  and  climate 
This  is  an  erroneous  statement,  for  a nonabsorbing  aerosol  (with 
no  imaginary  part)  can  reflect  the  solar  radiation  very  effectively 
lowering  the  available  solar  energy,  and  cause  a serious  modifi- 
cation of  the  heat  budget.  Perhaps  the  imaginary  part  is  of  con- 
sequence in  the  statements  of  local  heating  and  cooling  that  can 
be  of  importcince  in  the  smaller  scale  processes.  Unfortunately, 
the  type  of  measurements  we  propose,  where  the  bulk  properties  are 
m<;mitored,  is  probably  of  very  limited  value  for  these  studies. 
Thus,  the  imprecision  in  the  determination  of  the  imaginary  part 
of  the  index  of  refraction  is  not  a serious  drawback  for  this  type 
of  an  experiment. 

The  ground-based'  experiments  have  validated  the  theoretical 
scheme  that  parameterizes  the  aerosol  effects  in  the  atmosphere. 

The  radiative  transfer  equation  that  was  solved  to  determine  this 
scheme  was  an  integral  equation,  and  these  have  the  property  that 
in  order  to  obtain  the  value  of  the  radiation  field  in  the  downward 
direction  (this  is  what  is  monitored  using  the  ground-based 
polarimeter) , the  radiation  field  in  all  other  directions  must  be 
known.  That  is,  a solution  results  in  the  prediction  of  the  radi- 
ation field  in  all  other  directions.  Thus,  we  can  proceed  with  a 
great  deal  of  assurance  to  use  this  scheme  to  study  the  upwelling 
radiation  field,  a mode  of  measurement  that  would  be  used  by  a 
satellite-borne  polarimeter.  In  this  measurement  mode,  the  ground 
reflection  no  longer  plays  an  innocuous  role  and,  therefore,  the 
use  of  Lambert  reflection  for  the  ground  is  not  appropriate. 
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Our  sensitivity  studies  show  that  at  the  long  wavelength 
region  the  signal  received  by  a satellite  is  for  all  intents  cind 
purposes  the  ground  source.  Thus,  a long  wavelength  measurement 
of  the  intensity  and  polarization  would  be  approximately  a measure- 
ment  of  the  ground  reflection  intensity  and  polarization.  If  we 
then  make  the  assumption  that  these  characteristics  of  the  ground 
do  not  change  much  in  the  short  wavelength  region  (an  assimiption 
that  is  quite  reasonable  or  if  one  prefers  the  spectral  variation 
Ccin  be  modeled  on  the  basis  of  experimental  results  of  ground 
reflection  properties) , we  will  be  able  to  calculate  the  radiation 
field  in  the  short  wavelength  region  for  a realistic  ground  reflec- 
tion matrix.  Now,  if  an  algorithm  ccin  be  developed  for  this  type 
of  a ground,  then  we  will  be  able  to  infer  the  aerosol  parameters 
from  satellite  measurements.  This  is  the  general  scheme  that  we 
are  using  but  before  attempting  the  full  problem,  we  addressed 
ourselves  to  the  simpler  question,  "Can  algorithms  be  devised  to 
infer  aerosol  particle  characteristics  from  satellite  measurements, 
if  there  is  no  ground  reflection  problem?” 

Physically  speaking,  sea  surface,  if  it  is  smooth,  can  approxi 
mate  a zero  ground  reflection  surface,  if  the  specular  reflecting 
part  is  excluded.  Since  the  oceans  form  a large  part  of  the  scene 
viewed  from  a satellite,  this  is  also  a physically  relevant  study. 
Our  analysis  shows  that  aerosol  parameters  can,  indeed,  be  inferred 
even  though  the  algorithm  in  this  case  is  more  complicated-  Over 
80  sets  of  synthetic  data  were  analyzed  using  the  algorithm  and 
we  had  no  trouble  arriving  at  the  bulk  properties  of  the  aerosol 
parameters.  The  viewing  geometry  used  was  also  chosen  to  be  reason 
ably  close  to  that  of  a realistic  satellite  orbit.  It  is,  of 
course,  possible  to  extend  the  analysis  to  other  viewing  geometries 
so  as  to  arrive  at  the  optimum  measurement  configuration. 

The  case  of  zero  ground  reflection  assured  us  that  a polar i- 
meter  measurement  from  a satellite  over  the  oceans  can,  indeed, 
be  used  to  infer  the  aerosol  characteristics.  We  have  proceeded 
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to  consider  the  case  of  non-zero  ground  reflection.  This  will 
involve  a multi-spectral  analysis.  Preliminary  work  that  has  been 
completed  shows  that  when  a non-Lambert  ground  is  introduced  in 
the  short  wavelength  radiative  transfer  code,  the  algorithm  can 
still  be  used.  We  will,  during  the  course  of  the  year,  extend 
this  work  to  various  types  of  non-Lambertian  ground  surfaces  and 
carry  out  checks  on  the  algorithm. 
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constant  of  proportionality  in  Junge's  power  law 
distribution 

parameters  in  size  distribution  for  haze  H,  L,  and 
M,  respectively 
Lambert  ground  albedo 

parameters  in  size  distribution  for  haze  H,  L,  and 
M,  respectively 

specific  intensity  of  scattered  radiation  field 
= 27t/A 

refractive  index  of  scatterer 
size  distribution!  function 

degree  of  polarization  of  scattered  radiation  field 
extinction  efficiency  factor 
particle  radius 

minimum  and  maximum  value  of  r. 
observation  zenith  angle 
Sun  zenith  angle 
wavelength  of  light 

optical  depth  due  to  aerosol  particles 
observation  azimuth  angle 

power  exponent  on  r in  Junge's  power  law  distribution 
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DISCUSSIONS 

Chahine:  Have  you  compared  the  results  from  the  polarimeter  to 

those  from  the  multi-spectral  measurements? 

Kuriyan:  Yes.  About  two  or  three  years  ago.  Dr.  Chahine,  a stu- 

dent and  I wrote  a paper  on  multi-spectral  extinction  measurements. 
We  found  that  if  we  have  four  precise  measurements,  then  it  is 
possible  to  infer  the  effective  parameters-  But  unfortunately, 
the  precision  that  was  required  was  not  available  in  the  UCLA 
instrimient.  So  we  have  used  the  UCLA  instrxament  strictly  to 
measure  the  optical  depth  and  make  sure  that  at  least  one  parameter 
is  fixed-  What  I would  like  to  correct  is  the  feeling,  at  least  in 
certain  circles,  that  it  is  the  imaginary  part  of  the  refractive 
index  that  is  significant  for  weather  and  climate.  There  is 
nothing  farther  from  the  truth,  because  a very  highly  reflecting 
aerosol  (with  zero  imaginary  part)  can  change  the  albedo  suf- 
ficiently to  affect  weather  and  climate.  It  is,  therefore,  very 
important  to  determine  the  total  refractive  index — both  the  real 
and  imaginary  parts.  Our  method,  however,  will  not  allow  us  to 
determine  imaginary  parts  better  than  0.02. 

Barkstrom:  I think  on  the  downward  looking  experiment  one  ought 

to  use  considerable  care  regarding  the  model  of  the  lower  boundary 
condition.  It  is  possible  to  do  some  modeling  of  that  and  things 
seem  to  fall  into  two  categories.  But,  there  will  be  very  strong 
limb  brightening  for  a number  of  surfaces,  including  things  like 
ocean  and  vegetation. 

Kuriyan:  There  are  two  approaches  that  we  adopt.  One  is  to  look 

at  zero  ground  reflection  areas  (sea  surfaces)  and  to  do  v/hat 
Dr.  Quenzel  talked  about,  which  is  to  model  the  roughness  for  the 
sea  and  so  on.  But  the  other  approach  is  based  on  our  sensitivity 
studies.  Any  measurement  at  about  0.7  or  0.8  microns  seems  to  be 
entirely  due  to  the  signal  from  the  ground,  because  the  aerosol 
and  molecular  effects  are  so  small  and  what  we  get  is  really  a 
good  measurement  of  the  reflection  property  of  the  ground,  the 
polarization  of  the  ground  and  so  on.  Nor,  if  we  make  the  assxmip- 
tion  that  this  property  does  not  vary  when  you  go  to  the  short 
wavelength  region,  then  you  can  build  this  ground  reflection  prop- 
erty into  the  radiative  transfer  code.  So,  in  the  next  stage  of 
the  development  when  we  consider  land-based  targets,  we  include 
a reflection  matrix  which  is  measured  at  the  long  wavelength  fre- 
quencies . 

Barkstrom:  Yes,  if  you  want  to  model  that  with  a doubling  code 

you  may  want  to  adopt  a low  single  scattering  albedo  model  and  a 
high  single  scattering  albedo  model. 
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Kuriyan:  Yes,  I am  sure  we  will  have  to  consider  the  full  range. 

Remsberg:  Have  you  run  into  any  situations  where  your  catalog 

did  not  give  you  a proper  answer  or  an  answer? 

Kuriyan:  Some  of  the  data  that  we  gathered  did  not  correspond 

exactly  to  the  tables.  These  occurred  during  some  meteorological 
activity  in  the  area  (such  as  moving  air  masses)  and  so  it  is  not 
clear  if  this  is  an  evidence  of  the  deficiency  of  the  model.  In 
other  words,  during  stable  conditions,  the  data  that  we  gathered 
could  be  fit  with  the  Deirmendjian  H-type  distribution. 

Remsberg:  The  point  I was  getting  at  was:  suppose  you  had  distri- 

butions that  really  couldn’t  be  handled  by  the  modified 
Deirmendjian  case.  As  I understand  it,  your  catalog  is  really 
set  up  on  that  type  of  a distribution  where  you  vary  b as  your 
parameter.  If  you  had  some  multi-modal  distributions,  that  might 
not  hold- 

Kuriyan:  Well,  I didn't  discuss  this  point.  The  first  paper  that 

I wrote  on  the  subject  had  to  do  with  the  fact  that  Deirmendjian’ s 
haze  H,  L and  M were  equivalent  in  that  they  gave  rise  to  the  same 
phase  matrix.  I can  then  take  any  linear  combination  of  the  three 
models  and  produce  the  same  phase  matrix  and  I will  have  a tri- 
modal  distribution.  Thus,  I am  not  worried  about  multi-modal 
distributions.  They  don’t  interest  me,  because  I am  interested  in 
the  radiative  fluxes  and  both  types  of  distributions  yield  the 
same  fluxes. 
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EXPERIENCE  WITH  THE  INVERSION  OF  NIMBUS  4 


BUV  MEASUREMENTS  TO  RETRIEVE  THE 
OZONE  PROFILE 


Carlton  L.  Mateer 
Atmospheric  Environment  Service 


The  relative  merits  of  pressure  increment  and  partial 
derivative  formulations  of  the  ozone  inversion  problem  are 
discussed  briefly.  The  height  range  of  validity  of  the 
retrieved  ozone  profile  and  the  effects  of  adding  wavelengths 
to  or  of  dropping  wavelengths  from  the  inversion  system  are 
indicated.  Illustrative  results  are  presented  for  profiles 
retrieved  from  BUV  data  using  Backus-Gilbert/  minimum  infor- 
mation (Twomey) f and  quasi-optimum  procedures, 

I . INTRODUCTION 

The  Backscatter  Ultraviolet  (BUV)  experiment  on  Nimbus  4 
measures  nadir  direction  BUV  radiances  in  12  wavelength  channels 
from  0.2555  ym  to  0.3398  ym.  Four  long  wavelength  channels  (0.3125, 
0-3175,  0.3312,  0.3398  ym)  are  used  to  infer  total  atmospheric 
ozone  content,  and  the  eight  shorter  wavelength  channels  (0.2555, 
0.2735,  0.2830,  0.2876,  0.2922,  0.2975,  0.3019,  0.3058  ym)  are  used 
to  infer  the  ozone  profile  above  the  main  ozone  density  maximiam. 

In  this  paper,  we  are  concerned  only  with  the  latter  problem. 

The  normalized  contribution  or  weighting  functions  for  the 
experiment  are  illustrated  in  Fig.  1.  At  the  shortest  wavelengths, 
absorpti6n  is  so  strong  that  penetration  of  the  photons  into  the 
atmosphere  is  severely  impeded  and  a single  scattering  model  is 
very  accurate  for  calculating  the  backscattered  radiance. 
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CONTRIBUTION  FUNCTION 


Fig.  I.  Contributions  to  the  nadir  direction  radiance  by 
backscattering  at  various  levels  in  the  atmosphere,  d atm  = 
101.3  kPa.) 
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However,  as  we  progress  to  longer  wavelengths,  the  absorption 
coefficient  decreases,  and  the  photons  penetrate  more  deeply  into 
the  ozone  layer.  Eventually,  the  absorption  is  weak  enough  to 
ensure  that  a sufficient  number  of  photons  pass  through  the  ozone 
layer  and  multiple  scattering  in  the  troposphere  has  to  be  con- 
sidered for  the  accurate  calculation  of  the  backscattered  radiance 
(wavelengths  0.3058  ym  and  longer  in  Fig.  1)  . 

In  estimating  the  ozone  profile,  we  have  restricted  ourselves 
to  a consideration  of  those  wavelengths  which  are  relatively 
unaffected  by  multiple  scattering — namely,  those  for  which  at 
most  secondary  scattering  is  required  and  the  small  correction  to 
primary  scattering  may  be  parameterized.  There  are  two  reasons 
for  this  restriction.  First,  Yarger  (Ref.  1)  has  shown  that  there 
is  little  retrievable  information  about  the  ozone  profile  below 
the  ozone  density  maximum  in  measurements  at  those  wavelengths 
which  are  strongly  affected  by  multiple  scattering.  Second^  with 
many  ozone  profiles  to  be  retrieved,  the  necessary  multiple  scat- 
tering calculations  would  be  very  expensive.  As  we  shall  see, 
this  effectively  limits  the  validity  range  of  our  retrieved  pro- 
files to  levels  abdve  the  ozone  density  maximum. 


II.  FORMULATION  OF  THE  INVERSION  PROBLEM 

For  a single  scattering  model,  the  basic  integral  equation 
may  be  written  as 


Q(X,0^) 


•Po 

exp[-  (1  + secG  ) (a  X + g,p)]  dp 

^ O A p A 


(1) 


where  Q(X,0^)  is  the  BUV  measurement  at  wavelength  X and  solar 
zenith  angle  0^.  For  convenience,  Q incorporates  the  extrater- 
restrial solar  flux  as  well  as  certain  constants  aind  geometric 
factors; 
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is  the  ozone  absorption  coefficient;  X is  the  amount  of 

A p 

ozone  above  pressure  level  p;  3^  is  the  Rayleigh  scattering 
coefficient;  and  p^'  is  the  surface  pressure. 

For  the  four  shortest  wavelengths,  a X >>  3-.P-  If  we 

A p A 

set  k = a.  (1  + sec  0 ) and  use  the  exponential  ozone  profile 
/ o 

assumption  at  high  levels,  namely. 


X = 
P 


(2) 


where  C and  a are  the  constants  specifying  the  profile,  then  it 
is  easy  to  show  that 


Q(k)  = (kO'^^rCa  + 1)  (3) 

where  T (a  + 1)  is  the  gamma  function  for  parameter  a + 1.  In 
short,  a linear  regression  of  In  Q against  In  k for  the  four  short- 
est wavelengths  will  yield  an  estimate  of  the  profile  constants  C 
and  a.  We  shall  refer  to  this  profile  as  the  upper  first  guess. 
Next,  from  the  estimate  of  the  total  atmospheric  ozone  content  (see 
Ref.  2) , a lower  first  guess  is  constructed  from  average  balloon 
profiles.  The  lower  and  upper  first  guesses  are  objectively  com- 
bined to  provide  the  complete  first  guess  or  a priori  constraint 
profile  from  which  the  final  estimated  ozone  profile  is  obtained. 

We  have  used  two  convenient  formulations  for  this  final  impor- 
tant step.  The  first  of  these  is  the  partial  derivative  (P.D.), 
one  in  which 


dZn 
din.  K. 

X 


) 6£n  = 6£n 


2i 


(4) 


where  x^  is  the  amount  of  ozone  in  layer  i for  the  first  guess. 
Typical  derivatives  are  shown  as  the  dashed  curves  in  Fig.  2.  Use 
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of  the  logarithmic  derivative  has  the  advantage  that  the  peak 
values  for  each  wavelength  are  about  the  same  magnitude.  The 
second  formulation  is  the  pressure  increment  (P.I.)  one,  which  we 
have  used  in  a normalized  form,  as  follows,  from  the  quadrature 
form  of  Eq.  (1) : 


? t(pVQi)exp[-  (1  + sece^)  (a  S g x.  + 

± X A oX.  _A] 


3 = 1 


- Q. 


(5) 


where  the  asterisk  refers  to  the  observation  and  the  circumflex 
("hat")  symbol  refers  to  the  solution  profile,  p^  is  the  mean 
pressure  in  layer  i and  Ap^  is  the  pressure  change  across  the 
layer.  The  quantity  in  the  curled  braces  is  the  kernel  for  which 
typical  values  are  shown  by  the  continuous  curves  in  Pig.  2.  The 
normalization  (multiplication  of  each  exponential  by  p./Q^)  5ias 

r A 

the  effect  of  making  the  peaks  of  the  kernel  functions  approxi- 
mately the  same  magnitude. 

In  the  P.D.  method,  the  unknowns  are  the  logarithmic  ozone 
content  changes  for  each  layer.  Total  atmospheric  pressure  is 
conserved  exactly,  but  total  ozone  will  generally  differ  somewhat 
from  the  first  guess  value.  In  the  P.I.  method,  the  unknowns  are 
the  fractional  changes  in  the  scattering  mass  in  each  layer  and 
the  ozone  against  pressure  relationship  is  recovered  by  -integrating 
from  the  top  down.  Total  ozone  is  conserved  exactly,  but  total 
atmospheric  pressure  will  differ  somewhat  from  the  first  guess 
value.  These  are  trivial  considerations  for  profile  retrievals 
where  the  lower  limit  of  validity  of  the  final  profile  is  above 
the  ozone  density  maximum  and,  consequently,  well  above  the  Earth's 
surface. 
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PJ.WEIGmNG  FUNCTION 

Fig.  2.  P.T.  weighting  functions  (solid  curves);  partial  der. 
atives  (dashed  curves);  Backus-Gilbert  spread  (dotted  curve);  pro- 
file variance  reduction  by  qua si -optimum  (P^I,)  method  (small  circ. 
profile  variance  reduction  by  quasi-optimum  (P^D.)  method  (plus  si( 
Vertical  scales  at  right  show  location  of  maximum  for  all  waveleng} 
for  P.I.  and  P.D.  methods ^ respectively.  (1  bar  =>  100  kPa.) 


III.  INVERSION  PROCEDURES 


Results  from  three  inversion  procedures  are  presented  here. 
These  are  the  Backus-Gilbert/  the  minimum  information  (Twomey)  , cuid 
the  quasi-optimum  procedxires.  The  first  two  of  these  have  been 
amply  described  by  other  authors  at  this  Workshop.  The  quasi- 
optimum procedure/  as  used  herein,  differs  from  the  optimum  one  in 
that  the  covariance  matrices  for  the  constraint  profile  (first 
guess  profile)  and  for  the  observational  errors  are  purely  diagonal, 
having  nonzero  elements  only  on  their  diagonals.  The  Backus- 
Gilbert  method  is  used  here  primarily  in  a diagnostic  sense  to 
indicate  the  pressure  (height)  range  of  validity  of  the  derived 
profiles  and  to  illustrate  the  effects  of  adding  wavelengths  to  or 
of  dropping  wavelengths  from  the  inversion  system  as  dictated  by 
the  multiple  scattering  limitations  stated  earlier. 


IV.  RELATIVE  MERITS  OF  THE  PARTIAL  DERIVATIVE  AND  PRESSURE 

INCREMENT  FORMULATIONS 

If  there  are  relatively  few  data  sets  to  be  inverted  and/or 
computer  time  is  of  no  serious  consequence,  the  partial  derivative 
method  offers  the  advantage  of  simplicity  in  interpretation — it 
gives  the  change  in  ozone  in  each  layer  directly — and  we  are, 
after  all,  seeking  an  ozone  profile.  In  an  optimum  or  quasi- 
optimum procedure,  the  profile  covariance  matrix  may  be  interpreted 
directly  in  terms  of  profile  errors  at  different  levels.  However, 
unless  the  first  guess  or  constraint  profile  is  extremely  close  to 
the  final  solution,  the  P.D.  system  is  nonlinear — a minimum  of 
two  iterations  is  required  before  the  system  is  quasi-linear . If 
there  are  many  data  sets  to  be  inverted  and  computer  time  is  a con- 
sideration, the  computation  (and  re-computation  with  each  iteration) 
of  the  partial  derivatives  is  time  constaming. 
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On  the  other  hand,  the  pressure  increment  formulation  avoids 
this  time-consuming  partial  derivative  calculation  and  is  very 
nearly  linear  on  the  first  iteration  so  that  further  iterations 
are  not  required.  With  the  optimum  or  quasi-optimum  procedure, 
the  profile  covariance  matrix  can  probably  be  interpreted  directly 
in  terms  of  ozone  profile  errors  at  the  different  levels.  The 
P.I.  method  does  require  an  integration  to  recover  the  pressure- 
ozone  relationship,  followed  by  an  interpolation  back  to  the 
standard  layers  used  in  the  atmospheric  model,  a relatively  minor 
disadvantage. 


V.  PRESSURE  RANGE  OF  SOLUTION  VALIDITY 

Figures  3 and  4 contain  illustrative  results  for  the  Backus- 
Gilbert  method  using  the  P.D.  formulation.  The  solid  curves 
labeled  "SPREAD"  show  the  Backus-Gilbert  spread  (in  pressure 
scale-heights)  for  a constant  (profile)  error  of  5%  as  a function 
of  atmospheric  pressure.  The  measurement  error  was  assumed  to 
have  the  form 


a 


2 

A 


0.0001  + 


0.00005 


(6) 


where  is  the  measurement  error  of  the  backscattered  radiance 
(I^)  at  wavelength  A.  ' This  provides  for  a basic  measurement 
error  of  ± 1%  (1st  term)  plus  a noise  term  which  is  about  10%  for 
the  lowest  backscattered  radiances  measured  at  0.2555  ym.  Curves 
for  a constant  profile  error  of  10%  (for  a more  realistic  basic 
measurement  error  of  2%)  are  very  nearly  coincident  with  the 
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height,  km 


plotted  curves.  Figure  3 is  a high-latitude  northern  hemisphere 
case  for  December  1970.  The  "regular"  solution,  involving  small 
secondary  scattering  corrections  at  0-.3019  and  0.3058  ym,  would 
utilize  the  measurements  at  8 wavelengths  (N^  = 8) . The  solution 
profiles  are  the  dashed  curves.  The  solution  profile  for  = 6 
diverges  appreciably  from  the  N,  = 8 solution  profile  at  about 

3 mb  and  the  N = 7 solution  profile  diverges  appreciaibly  at  about 

A 

6 mb  (1  bar  = 100  kPa) . These  are  the  levels  at  which  the  spread 

begins  to  increase  rapidly  and  one  is  tempted  to  infer  that  these 

levels  represent  the  respective  lower  validity  limits  for  the  N = 

A 

6 and  = 7 solution  profiles.  If  this  is  correct,  then  the 
lower  validity  limit  for  the  8 wavelength  solution  profile  is  near 
10  mb,  and  the  upper  validity  limit  is  near  0.4  to  0.5  mb.  Figure 

4 is  a high-latitude  case  for  the  northern  hemisphere  in  May  1970, 

for  a solar  zenith  angle  of  61°,  The  "regular"  solution  would 
utilize  7 wavelengths  and  would  involve  small  secondary  scattering 
corrections  at  0.2975  and  0.3019  ym.  In  this  case,  the  picture  is 
not  so  clear.  The  = 4 and  = 5 solution  profiles  remain 

close  together  down  to  30  mb,  but  they  diverge  appreciably  from 
the  = 6 and  = 7 solution  profiles  below  about  7 mb,  which 

is  also  the  level  at  which  the  spread  curve  for  N.  = 5 begins  to 
increase  rapidly.  The  = 6 and  = 7 solution  profiles  are 
virtually  coincident  down  to  about  16  mb  and  thereafter  are  sepa- 
rated by  about  5%.  The  16  mb  level  is  also  the  point  where  the 
spread  curve  for  N = 6 begins  to  increase  rapidly.  It  appears 
that  the  lower  validity  limit  for  the  = 7 solution  profile 
should  be  about  25  to  30  mb.  In  this  case,  in  common  with  others, 
the  upper  validity  limit  is  not  so  clear-cut  as  in  Fig.  3,  but 
would  appear  to  be  at  about  0,5  mb. 

The  dotted  curve  in  Fig.  2 is  the  spread  for  = 8-  If  we 
wanted  to  estimate  the  upper  and  lower  validity  limits  of  derived 
profiles  without  calculating  the  spread,  it  appears  that  the 
upper  validity  limit  is  approximately  at  the  level  where  the 
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G. 2555  ym partial  derivative  is  a maximum,  and  the  lower  validity 
limit  is  at  the  level,  below  the  maximiom  of  the  partial  derivative 
for  the  longest  wavelength  used,  where  the  partial  derivative  is 
about  70%  of  the  maximum  value- 

Information  about  the  probable  range  of  solution  validity  may 
also  be  estimated  from  the  quasi-optimum  method-  This  has  been 
done  for  both  the  P.I.  and  P-D-  formulations  by  using  a constant 
value  0.4  for  the  variance  of  the  first  guess  or  constraint  profile 
at  all  levels  and  noting  the  variance  reduction  in  obtaining  the 
solution  profile-  Curves  of  this  variance  reduction  are  labeled 
6a^  (P.I.)  and  6o^  (P.D.)  on  Fig.  2.  These  curves  do  not  delineate 
validity  limits  as  well  as  the  Backus-Gilbert  spread.  They  suggest 
a possible  upward  extension  of  the  upper  limit  for  the  P.I.  formu- 
lations and  a possible  downward  extension  of  the  lower  limit  for 
the  P.D.  formulation,  as  compared  to  Backus-Gilbert.  In  all  cases, 
it  is  clear  that  the  lower  validity  limit  is  above  the  main  ozone 
density  maximiim,  which  is  generally  within  the  15  to  25  km  range, 
depending  on  latitude  and  season. 

VI.  SOME  ILLUSTRATIVE  RESULTS 

Figures  5 to  8 inclusive  show  final  solutions  derived  by 
various  methods  for  four  BUV  wavelength  scans-  In  each  case,  the 
first  guess,  or  constraint,  profile  is  a solid  curve;  and  the 
remaining  profiles  may  be  identified  as  follows.  Case  1 — Profile 
curve  of  short  dashes:  This  is  the  one-iteration  Backus-Gilbert 

solution  for  the  P.D.  formulation;  secondary  scattering  corrections 
were  not  applied;  profile  error  is  a constant  5%  or  10%  for  ± 1 or 
± 2%  basic  measurement  error,  respectively  (as  in  Eq.  (6)). 

Case  2 — Profile  curve  of  long  dashes:  This  is  the  quasi-optimum 

solution  for  the  P.D.  formulation;  constraint  profile  varicince  is 
a constant  0.4;  basic  measurement  error  is  2%^  Case  3 — Profile 
curve  of  square  symbols:  This  is  the  quasi-optimum  solution  for 

the  P.I.  formulation;  constraint  profile  variance  is  a constant 
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OZONE  MIXING  RATIO,  ,yg/g 

Fig.  5.  First-guess f assumed  first-guess  profile  variance 
and  various  solution  profiles.  See  text  for  explanation  of 
symbols.  (1  bar  = 100  kPa.) 
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Fig.  8.  First-guess  and  various  solution  profiles. 


text  for  explanation  of  symbols,  (1  bar  = 100  kPa;  1 atm 
101,3  kPa,) 


0.02;  basic  measurement  error  is  ± 2%;  for  this  combination.  The 
overall  constraint  is  about  the  same  as  in  Case  2.  Case  4 — 

Profile  curve  of  triangular  symbols:  This  is  the  quasi-optimum 

solution  for  the  P.D.  formulation;  constraint  profile  variance  is 
given  by  the  curve  so  identified  in  Fig.  5;  basic  measurement 
error  is  ± 2 % . This  is  the  strongest  constraint  of  the  five  cases 
presented  and  the  solution  profile,  therefore,  deviates  least 
overall  from  the  constraint  profile.  Case  5 — Profile  curve  of 
small  circles:  This  is  the  minimum  information  (Twomey)  solution 

forthe  P.I.  formulation;  constraint  profile  variance  is  the  identity 
matrix;  the  measurement  variance  is  invariant  with  wavelength  and 
is  0.02  times  the  trace  of  the  kernel  "cross-product"  matrix; 
secondary  scattering  corrections  were  not  applied. 

For  comparison,  the  average  45^N  rocket  profile  derived  by 
Krueger  and  Minzner  (Ref.  3)  is  shown  in  Fig.  7.  At  most  levels, 
we  find  solution  profiles  which  are  both  higher  and  lower  than  the 
mean  rocket  profile  for  the  four  cases  presented.  For  each  indi- 
vidual BUV  scan,  the  difference  between  the  highest  and  lowest 
solution  profile  at  any  level  does  not  exceed  about  20%.  Although 
total  ozone  is  nearly  the  same  for  the  BUV  scans  profiled  in 
Figs.  5 cind  7,  the  inferred  high-level  ozone  profiles  are  very 
different.  These  differences  illustrate  changes  that  appear  to 
occur  between  profiles  before  (Fig.  5)  and  after  (Fig.  7)  the 
well-known  stratospheric  warming  phenomenon  of  high-latitude 
winters - 

VII.  CONCLUDING  REMARKS 

From  the  illustrative  solutions  themselves,  there  appears  to 
be  little  to  choose  between  the  P.I.  and  P.D.  formulations  (com- 
pare curves  of  long  dashes  and  of  squares) . However,  if  one  has 
a million  or  more  BUV  scans  to  evaluate,  the  P.I.  f emulation 
should  lead  to  significcuit  savings  in  computer  time. 
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Since  the  solution  profiles  depend  significantly  on  the  con- 
straint or  first  guess  profile,  and  on  the  variances  assigned  to 
this  profile  and  to  the  instrijinent  ooservations,  considerable 
attention  should  be  paid  to  these  factors  in  establishing  a BUV 
evaluation  system.  The  merits  of  using  full  covariance  matrices 
as  opposed  to  the  diagonal,  or  variance  matrices  used  here,  is 
not  clear.  One  might  be  tempted  to  conclude  that  the  use  of  the 
complete  profile  covariance  matrix  could  lead  to  spin-off 
"correlation"  information  about  the  ozone  profile  below  the  main 
density  maximum.  I doubt  this  very  much;  more  such  information  is 
likely  available  from  total  ozone-profile  correlations  and  should 
be  incorporated  in  the  first  guess  profile.  In  any  event,  the 
information  necessary  to  establish  the  full  covariance  matrix  is 
not  yet  available.  A partial  covariance  matrix  can,  of  course,  be 
generated  from  the  BUV  data  themselves  by  operating  "along  the 
track"  or  on  a regional-seasonal  basis.  Such  an  approach  may  be 
adopted  for  the  evaluation  of  the  data  from  the  Scanning  BUV 
experiment  on  Nimbus  G. 


SYMBOLS 

C,a  constants  in  equations  for  exponential  ozone  profile 

I backscattered  radiance  at  wavelength  X 

A 

k a,  (1  + sec  0 ) 

A o 

number  of  wavelengths  used  in  an  inversion 
A 

p atmospheric  pressure 

p^  atmosphere  pressure  at  the  Earth’s  surface 

p^  average  atmospheric  pressure  in  layer  i 

Ap^  atmospheric  pressure  change  across  layer  i 

Ap.  estimate  of  Ap.  for  solution 

1 1 

P.D.  partial  derivative  inversion  formulation 

P.I.  pressure  increment  inversion  formulation 

Q quantity  defined  by  Eq.  (2) 

A 

Q*  observed  value  of  Q 

A 
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0 ) integral  function  of  wavelength  and  zenith  angle  which 
o 


r(a) 

6a^ 


0 

o 

A 


is  proportional  to  backscattered  radiance 
value  of  Q in  layer  i 

amount  of  ozone  in  lyaers  i and  j , respectively 
amount  of  ozone  above  pressure  level  p 
ozone  absorption  coefficient  at  wavelength 
molecular  scattering  coefficient  at  wavelength 
gamma  function  of  argument 

reduction  in  error  variance  for  an  ozone  profile 
achieved  in  an  inversion 
solar  zenith  angle 
wavelength 

amount  of  ozone  in  layer  i 
total  amount  of  ozone 
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DISCUSSION 


Cerni:  Were  any  rocket  flights  taken  simultaneously  as  the 

satellite  passed  overhead  which  could  serve  as  ground  truth 
measurements? 

Mateer:  We  do  not  have  very  many.  There  are  two  for  which  I have 
the  results,  although  I understand  that  there  are  more.  Here  is 
one  taken  on  February  24,  1971,  at  Barking  Sands , .Hawaii,  which 
is  not  too  bad.  (Shows  Fig.  D-1).  This  was  done  with  the  pressure 
increment  formulation  and  the  Twomey  minimum  information  method 
with  a rather  small  value  for  Y;  I believe  it  was  0.005  times 
the  trace,  which  is  a little  too  small. 
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MIXING  RATIO  ./igm/gm 

Fig.  D-1.  Mixing  ratio  comparison  for  Barking  Sands  data 
(1  bar  = 100  kPk)  .. 


TEMPERATURE  SENSING:  THE  DIRECT 


ROAD  TO  INFORMATION 


Lewis  D.  Kaplan 
The  University  of  Chicago 


The  retrievability  of  detailed  temperature  soundings 
from  remote  measurements  of  emission  spectra  depends  not  so 
much  on  how  the  data  are  treated  as  on  what  the  data  are.  It 
is  shown  that  the  shape  of  the  weighting  functions  depends 
on  the  nature  of  the  pressure  and  temperature  dependence  of 
the  transmittance,  which  differ  from  one  part  of  the  spectrum 
to  another  as  well  as  with  spectral  resolution. 

Included  are  results  of  a study  by  Kaplan,  Chahine, 
Susskind , and  Searl , which  show  that  careful  selection  of 
channels  can  result  in  much  narrower  weighting  functions  than 
those  corresponding  to  channels  that  have  actually  been  used. 
More  detailed  and  more  accurate  retrievals  can  be  obtained , 
therefore , if  the  instrumental  design  is  more  carefully  planned 
on  the  basis  of  sound  theoretical  principles . 


If  there  is  any  common  thread  of  this  Workshop,  it  is  the  ques- 
tion of  information  content.  In  particular,  much  of  the  discussion 
has  been  devoted  to  ways  of  obtaining  more  infoormation  out  of  the 
measurements  that  are  being  made.  Dr.  Rodgers,  for  example,  sug- 
gested introducing  other  a priori  information,  which  he  calls  "vir- 
tual measurements."  The  difficulty,  however,  is  that  we  are  using 
the  measurements  for  a purpose,  namely,  the  use  of  the  retrievals 
in  numerical  weather  prediction  and  the  fact  is  that  numerical 
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is  twice  as  narrow.  In  fact,  if  the  transmittance  is  given  by 


Tyj  = exp(-  Cjp"),  (6) 

the  weighting  function  has  the  form 

- dT.  /d£np  = nc  .p^xp  (-  c .p^)  (7) 

vj  3 3 

which  we  may  rewrite  as 

- dx^j/d£np  = np^exp(-  p^)  (7a) 

if  we  take  p in  units  of  p = 1/c . ; and  the  ratio  of  the 

max  ] 

weighting  function  at  p to  its  value  at  p is  given  by 

*^max  ^ ^ 

R = p^expd  - p^)  = f(p^)  = R(nilnp)  (8) 


Thus,  the  entire  shape  is  compressed  by  a factor  of  n on  a inp 
scale  relative  to  that  of  Eq.  (3) . 

It  turns  out  that  it  is  not  so  easy  to  look  directly  in  the 
wings  of  lines,  particularly  in  the  15  ym band  of  CO^/  which  is 
strewn  with  very  many  isotope  and  Fermi  resonance  bands.  In  fact, 
the  only  part  of  the  CO^  atmospheric  emission  spectrum  in  which 
the  line  wings  are  clean  is  the  R branch  of  the  4.3  ym  band.  This 
region  of  the  spectrum  has  the  added  advantage  of  strong  tempera- 
ture dependence  of  the  black  body  function,  which  further  narrows 
the  tropospheric  part  of  the  information  function  Bdx/dtop.  There 
is  a broadening  effect  in  the  stratospheric  part,  but  it  is  more 
than  compensated  for  by  the  strong  exponential  decay  in  a weighting 
function  of  the  form  of  Eq.  (5)  . 

This  is  illustrated  by  the  curves  of  Fig.  1,  taken  from  the 
aforementioned  Applied  Optics  paper.  The  left-hand  side  contains 
weighting  functions  fof  very  narrow  band  passes  centered  in  the 
wings  between  some  of  the  closely  spaced  lines  of  the  4.3  ym  band 
R-branch.  They  are  indeed  very  narrow  and  become  even  narrower 
in  the  troposphere  when  multiplied  by  the  black  body  intensity  B, 
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4.2  pm  weighting  functions  end  normelized  integrand 
cm  ^ . (X  bar  = 100  kPa.)  (From  Ref.  1.) 


as  shown  on  the  right-hand  side,  to  obtain  the  source  distribution 
of  the  emerging  photons.  In  the  stratosphere,  multiplication  by  B 
has  a broadening  effect,  however,  which  illustrates  the  point  made 
earlier  by  Dr.  Chahine  that  less  information  comes  from  the  regions 
of  minimum  temperature,  because  the  signal  is  mixed  with  effective 
noise  on  either  side. 

But,  in  the  troposphere,  both  sets  of  curves  are  very  narrow 
indeed,  and  we  have  many  more  channels  than  have  been  usable  for 
sounding  up  to  this  time.  The  ratio  of  the  pressures  at  the  half- 
peak points  of  the  weighting  functions,  which  should  be  about  3-1/2 
according  to  our  previous  discussion,  turns  out,  in  fact,  to  be 
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about  2-1/21  The  reason  for  this  is  that  there  is  a further 

o 

weighting  besides  the  p factor  in  the  wings  of  the  lines,  namely, 
that  due  to  the  Boltzmann  factor,  which  enters  linearly  into  the 
wing  absorption  coefficients.  We  had  assumed  in  arriving  at  Eqs. 

(4)  and  (5)  that  the  line  intensities  were  independent  of  tempera- 
ture. They  are  not,  however,  and  the  Boltzmann  factor  has  the 
effect  of  further  narrowing  the  weighting  functions  of  the  chan- 
nels near  the  high-J  lines,  which  peak  in  the  middle  and  lower 
troposphere.  The  effective  power  n of  p for  the  middle  troposphere 
channels,  which  appears  in  Eqs.  (6),  (7)  and  (8),  is  about  three. 

For  the  lowest  weighting  functions,  it  is  closer  to  two  since  the 
temperature  independent  induced  absorption  and  far  wings,  of  the 
strongest  CO^  lines  dominate. 

These  weighting  functions  were  calculated  at  Goddard  Institute 
for  Space  Studies  by  using  the  Susskind  line-by-line  transmission 
program  with  an  assumed  resolution  of  0.2  cm”^,  which  is  very  dif- 
ficult to  accomplish  in  practice.  We,  therefore,  decided  to  see 
how  much  narrowing  of  the  weighting  functions  would  result,  at  not 
so  high  a resolution,  from  the  temperature  effect  combined  with  the 
p effect  of  the  large  amount  of  background  emission  from  the 
collision  broadened  fundamental. 

Figure  2 shows  the  results  of  calculations  for  an  optimized 
set  of  2 cm”^  channels.  At  this  spectral  resolution,  each  channel 
includes  three  to  five  lines.  The  vertical  resolvability  of  the 
temperature  profile  is  definitely  not  as  good,  but  it  is  not 
really  bad  either.  The  ratios  of  the  pressures  at  half-peak  levels 
of  the  weighting  functions  are  generally  of  the  order  3 in  the 
troposphere,^  and  the  effect  of  the  black  body  factor  results  in  a 
restriction  of  the  information  content  to  a very  narrow  region 
for  each  of  the  tropospheric  channels. 

^The  sharp  decay  of  the  lower  parts  of  the  weighting  functions 
results  in  part  from  the  regular  spacing  of  the  lines,  which  maxi- 
mizes the  rate  of  decrease  of  transmittance  with  depth. 
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Fig.  2.  4.2  van  weighting  functions  and  normalized  integrand 

for  Av  = 2 cjn”^.  (1  bar  = 100  kPa.)  (From  Ref.  1.) 


Another  effect  of  the  B factor  at  this  decreased  spectral 
resolution,  however,  is  an  unacceptable  smearing  of  the  strato- 
spheric information  content  curves  on  the  right-hand  side  of  the 
figure.  This  can  be  seen  to  be  the  effect  of  the  B factor  by  com- 
parison with  Fig.  3,  which  presents  similar  curves  for  0.5  cm  ^ 
channels  in  the  15-ym  band  of  CO^ • The  15-ym  channels  are  defi- 
nitely better  for  sounding  the  stratosphere. 

An  obvious  solution  to  the  sounding  problem  is  to  graft 
together,  like  Dr.  van  de  Hulst*s  apples,  the  4.2-ym  channels  for 
the  troposphere,  where  they  give  better  vertical  resolution,  with 
the  15-ym  channels  for  the  stratosphere,  where  B-smearing  is 
minimal.  It  is  not  too  difficult  to  make  an  instrument  like  this 
and  Dr.  Chahine  is,  in  fact,  starting  a design  study  for  such  a 
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sounder.  Figure  4 contains  curves  of  Bdx/dilnp  for  such  a grafted 
sounder  on  the  right.  It  is  compared  with  similar  curves  on  the 
left  for  the  channels  of  the  NIMBUS  6 High  Resolution  Infrared 
Radiation  Sounder  (HIRS)  that  Dr.  Susskind  believes  to  give  non- 
redundant  information,  and  which  he  uses  for  his  retrievals. 

This  comparison  shows  the  improvement  in  discretization  of 
the  information  content  of  sampled  emission  spectra  that  can  be 
obtained  by  careful  selection  of  channels.  It  illustrates  that 
proper  planning  of  the  measurements  themselves  may  resolve  some  of 
the  problems  that  we  have  devoted  so  much  time  to  during  this 
Workshop. 
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DISCUSSION 


Pepin:  I would  like  to  ask  about  the  atmospheric  contaminants 

that  are  in  the  four  micron  region.  Have  you  looked  at  the  effects 
of  some  of  the  contaminants? 

Kaplan:  In  this  part  of  the  spectrum  there  aren’t  any.  There  are 

a few  very  weak  water  lines  all  the  way  down  on  the  tails  of  the 
bands.  There  may  be  a few  nitrous  oxide  lines  but  they  are  very 
weak  transitions;  they  would  be  among  the  isotope  bands.  The  main 
contaminant  is  actually  a help.  It  is  the  pressure  broadened 
nitrogen  fundamental.  It  isn’t  even  contaminated  by  carbon  dioxide 
lines  in  this  region.  The  extra  lines — the  hot  bands  and  the  iso- 
tope bands — are  almost  all  in  the  P-branch. 

Rodgers:  I know  it  is  very  attractive  to  think  that  the  number  of 

photons  is  proportional  to  the  amount  of  information  you’ve  got, 
but  it  isn’t  true.  It’s  the  rate  of  change  of  the  number  of  photons 
with  respect  to  the  thing  you  are  interested  in. 

Kaplan:  Yes,  okay. 

Rodgers:  So  you  should  have  dB/dT  in  there. 

Kaplan:  No,  you  shouldn’t  have  dB/dT  in  there.  What  usually  is 

done  is  to  write  (dB/dT) (dT/d£np)  for  what  you  really  want,  which 
is  (d/dT) /B  (dr/dilnp)  d£np.  And  that  is  all  right  as  long  as  the 
transmission  is  independent  of  temperature.  What  we  are  talking 
about  here  are  very  strong  temperature  dependent  transmissions. 

So  one  should  actually  do  the  derivatives.  Dr.  Susskind  has  done 
the  derivatives  and  they  turn  out  to  be  very  much  the  same  as  what 
I have  shown. 

Rodgers:  And  the  other  point  is  that  there  is  no  reason  why  you 
shouldn’t  use  selective  absorption  to  improve  the  shape  of  the 
weighting  functions.  This  is  almost  as  good  as  going  to  very  high 
resolution. 

Kaplan:  Well,  one  of  the  problems  with  selective  absorption  is 

that  when  you  are  making  measurements  over  a broad  band  you  are 
taking  into  account  not  only  the  wings  of  the  strong  lines  that 
you  are  after,  but  the  central  portions  of  weaker  lines  that  fall 
within  this  region.  If  you  compare  the  SCR  (Selective  Chopper 
Radiometer)  weighting  functions  with  the  weighting  functions  that 
we  have  gotten,  you  will  see  this  effect  coming  in  because  the  SCR 
doesn’t  really  get  what  one  would  expect  from  the  p^  effect.  I 
attribute  it,  at  least  in  part,  to  the  weak  lines. 
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Rodgers:  Have  you  tried  computing  SCR-type  weighting  functions 

for  your  case  to,  say,  five  wave  number  intervals? 

Kaplan:  Yes,  we  did. 

Wark:  There  happens  to  be  another  little  factor  here  called 

aerosols  and  these  are  the  main  contaminants  that  you  have.  There 
are  clouds-  And  for  one  thing  one  must  first  solve  for  the  amount 
or  the  effects  of  clouds,  because  these  weighting  functions  are 
grossly  distorted  by  the  influence  of  clouds.  FurthernK^re , in 
this  particular  spectral  region  those  very  clouds  also  reflect  sun- 
light very  severely.  Would  you  please  comment  on  these  two  things? 

Kaplan:  Yes,  I am  very  glad  you  brought  up  the  subject.  First  of 

all,  I really  promised  in  the  original  abstract  to  talk  about  the 
combination  of  the  bands  for  the  various  things.  One  of  the 
advantages  of  making  measurements  using  an  array  of  detectors  as 
was  used  on  the  SIRS  instrument,  as  you  know,  is  that  one  can  make 
measurements  in  different  orders.  It  turns  out  that  if  you  look 
at  the  4 micrometer  band  in  fourth  order,  you  are  looking  at  a 15 
micrometer  band  in  first  order,  and  that  is  why  I have  a half  a 
wave  niamber  rather  than  two  wave  numbers  for  that  15  ym  band.  You 
are  looking  in  the  window  region  in  second  order  and  6 micrometer 
water  band  in  third  order.  Probably  the  most  effective  way,  I 
think,  of  trying  to  eliminate  the  effects  of  clouds — we  usually 
have  partial  cloud  cover — is  to  make  temperature  sounding  measure- 
ments, .not  necessarily  complete  soundings,  throughout  the  whole 
region  in  both  the  4 micrometer  and  the  15  ym  band  as  has  been 
outlined  by  Dr.  Chahine-  Also  to  take  advantage  of  the  changes  of 
the  black  body  function  and  their  differences  in  the  two  bands,  and 
it  gives  us  much  more  precise  information  about  the  clouds.  And 
getting  this  high  resolution  in  the  4 micrometer  band  means  that 
you  will  be  able  to  specify  it  even  more-  I think  that  being  able 
to  sound  in  the  presence  of  clouds  is  to  us,  in  a way,  the  most 
important  reason  for  trying  to  do  the  analysis  in  this  direction. 
With  regard  to  the  scattering,  it  is  a problem.  But  it  is  pos- 
sible again  to  make  measurements  of  sunlight  outside  of  the  band. 
This  is  being  planned.  But  the  other  thing  that  should  be  remem- 
bered is  that  when  you  see  the  scattered  sunlight  it  is  going 
through  two  traversals.  You  are  usually  looking  straight  down, 
so  you  usually  have  an  air  mass  that  is  somewhat  larger  than  two — 
relative  to  the  vertical  soundings.  And  this  does  help  a great 
deal  to  eliminate  the  effect;  however,  it  probably  still  remains 
there,  particularly  for  high  clouds-  It  is  necessary  to  look 
at  reflected  radiation  at  several  wavelengths  to  try  to  at  least 
get  the  outline  of  the  sunlight  and  hope  that  when  you  extrapolate 
it  into  the  R-branch  of  the  four  micron  band  that  it  keeps  the 
same  properties. 
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Barkstrom:  It  seems  to  me  that  there  is,  perhaps,  a general 

question  that  you  have  raised  which  is  related  to  the  question  of 
how  the  measurements  are  to  be  used.  The  question'  you  seem  to  be 
asking  is,  what  does  the  meteorological  community  need  in  terms  of 
accuracy  on  temperature  structure  and  what  is  the  resolution  that 
is  needed  in  order  to  derive  a model?  And  it  seems  that  some  of 
the  things  Dr.  Rodgers  mentioned  earlier  regarding  the  uncertainty 
in  a statistical  sense  could,  perhaps,  be  tied  into  the  things  that 
you  are  saying  here.  And  then  the  question  is  raised,  what  do  we 
need  to  know  from  the  meteorological  community  in  order  to  do  the 
job?  Let  me  make  this  a little  more  quantitative.  If  I remember 
Dr.  Rodgers*  presentation,  it  seems  that  there  is  a relation 
between  the  uncertainty  in  the  measurement  and  the  error  in  the 
satellite  measurements.  The  question  would  be  whether  we  could 
get  estimates  of  the  uncertainty  that  is  needed  from  the  meteo- 
rological community  which  could  then  be  inverted  to  estimates  of 
the  error  and  the  resolution  needed  in  the  measurements? 

Kaplan:  Yes,  first  of  all  I am  a meteorologist  and  that's  why  I 

am  doing  this.  I feel  that  I need  this  data.  The  uncertainties 
themselves  really -are  related  to  the  resolution.  For  example,  in 
the  discussion  of  Chahine's  relaxation  method — when  it  is  used  in 
connection  with  the  HIRS  data  or  other  data,  it  is  an  a priori 
method  in  principle.  But  it  turns  out  not  to  be  an  a priori  method 
in  practice  when  the  levels  you  are  sounding  are  widely  removed 
because  you  do  have  to  make  assumptions  about  how  the  atmosphere 
is  behaving  all  along  the  way.  The  atmospheric  temperature  does 
not.  change  very  much  with  height  and  if  you  can  make  a measurement, 
say,  every  kilometer,  then  it  would  be  very  difficult  to  obtain  a 
temperature  distribution  which,  in  connection  with  numerical 
weather  prediction,  can  turn  out  to  be  very  much  different  from 
what  you  want.  It  also  turns  out,  contrary  to  what  the  dynamic 
meteorologists  had  believed,  that  in  certain  parts  of  the  atmo- 
sphere, particularly  in  the  tropics,  small  changes  in  static  sta- 
bility of  the  upper  troposphere  makes  a difference  in  the  tropical 
circulation.  And  these  small  changes  are  particularly  what  we  are 
looking  for.  If  you  constrain  by  introducing  statistics,  you  are 
throwing  out  information,  you  are  keeping  yourself  from  gaining 
information  that,  is  the  most  interesting  part  of  the  meteorological 
information  you  want.  Also,  if  we  are  looking  for  climatic  change, 
and  we  feel  that  there  has  been  a change,  due  to  the  introduction 
of  aerosols,  and  the  changes  in  ozone  distribution  and  photo- 
chemistry, we  cannot  use  past  data  in  order  to  interpret.  And  if 
we  try  to  make  models  based  upon  temperature  soundings  in  order 
to  use  the  statistics  we  would  not  have  been  able  to  reproduce, 
for  example,  the  last  month  and  a half's  soundings  because  the 
temperature  distribution  during  that  period  would  not  have  ever 
really  existed  in  the  90-year  history  of  radiosonde  soundings.  It 
was  at  least  94  years  since  we  had  weather  conditions  east  of  the 
Rockies  as  we  have  now.  These  anomolous  conditions  are  what  we 
are  mostly  after. 
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OPEN  DISCUSSIONS  II 


van  de  Hulst:  We  have  gradually  entered  into  the  open  discussion 

and  I think  Dr-  Barkstrom  put  the  very  appropriate  question  for 
this  entire  conference.  I have  some  comment  on  my  own  on  that, 
but  they  are  not  very  expert.  Does  somebody  else  want  to  comment? 

Susskind:  I want  to  make  a comment  that  is  not  very  expert  at  all. 
It  is  just  a feeling  that  I have  about  real  measurements  and  virtual 
measurements  and  something  like  this.  I never  thought  about  virtual 
measurements  before,  but  what  we  are  looking  for  in  soundings.  . . 
The  point  I want  to  make  is  I feel  there  is  certainly  information 
content  in  the  observations,  the  real  measurements,  but  the  infor- 
mation content  is  not  sufficient  to  give  you  everything  you  are 
looking  for.  As  a matter  of  fact,  if  you  are  looking  for  a detail 
temperature  profile, it  is  not  sufficient  to  give  that  at  all.  And 
to  do  this  you  have  to  make  some  kind  of  assumptions,  but  to  start 
saying  that  assumptions  that  are  based  on  statistics  or  some  feeling 
of  what  you  think  is  happening  is  a real  measurement,  I think  that 
one  might  lose  sight  that  you  were  not  really  measuring  something. 
Let  us  say  you  think  there  is  a tropopause  at  100  millibars  and  you 
force  this  into  your  solution  when  you  have  not  got  it  in  your 
vertical  resolution,  you  might  come  out  feeling  convinced  that  I 
have  found  the  tropopause  at  100  millibars  when,  in  fact,  that  maybe 
it  is  not  there.  And  I think  you  have  to  bear  in  mind  limitations 
that  this  really  is  not  there  and  you  should  not  convey  the  impres- 
sion to  someone  else,  say,  who  actually  does  not  know  exactly  what 
you  are  doing,  that  you  are  actually  determining  something  that 
really  is  not  there.  It  is  just  a feeling  I have  about  this.  I 
would  like  someone  to  make  a comment  about  whether  they  think  they 
are  really  determining  something  that  is  there  or  that  maybe  is 
likely  to  be  there  or  what? 

Strand:  With  regard  to  these  virtual  measurements,  you  put  them 

in  with  the  correct  expression  of  how  much  faith  you  have  in  them. 

If  you  put  them  in  with  grossly  incorrect — for  instance,  there  is 
something  that  deserves  a large  covariance  matrix  and  you  put  in 
a small  one,  then  you  can  expect  trouble.  But  normally  on  these 
things  we  put  in  covariances  that  are  commensurate  with  the  degree 
of  our  knowledge  of  whatever  we  put  in  for  a virtual  measurement. 

I am  inclined  to  agree  with  Dr.  Rodgers  that  these  are  every  bit 
as  valid  as  measurements  made  with  some  kind  of  machinery.  Just 
because  some  kind  of  machine  measures  it  does  not  mean  it  is  any 
more  or  less  sacred  than  something  that  you  put  in  from  your  sta- 
tistical knowledge. 

Rodgers:  Well,  I was  going  to  say  that  I agree  with  Dr.  Susskind. 

It  is  exactly  the  sort  of  thing  I meant  when  I said  you  have  got 
to  make  quite  sure  any  virtual  measurements  you  put  in  are  right. 


613 


Chahine : 


How? 


Rodgers:  That  is  entirely  up  to  you. 

Staelin:  Let  me  attempt  to  format  the  question  a litle  differently. 

I would  say  that  one  of  the  biggest  unsolved  problems  in  the  sta- 
tistical approach,  which  I think  is  the  proper  approach,  is  how 
best  to  select  the  a priori  statistics.  If  the  atmospheric  sta- 
tistics were  stationary,  then  it  would  be  an  easy  matter.  An 
appropriate  set  of  statistics  would  simply  involve  a sufficiently 
large  ensemble  to  make  the  errors  arbitrarily  small.  The  problem 
arises  because  the  atmospheric  statistics  are  nonstationary  in 
space  and  in  time.  The  means  and  standard  deviations  are  both 
very  slowly  varying  functions.  It  seems  incumbent  on  people  who 
wish  to  exploit  these  techniques  further  to  devise  procedures  other 
than  the  ad  hoc  ones  that  we  are  presently  using.  For  example, 
for  the  microwave  data,  we  now  simply  pick  an  a priori  ensemble 
that  is  overly  large.  We  use  the  entire  globe.  We  would  prefer 
to  take  small  regions  but  we  once  erred  by  overspecializing  the 
statistics-  We  broke  the  globe  into  seven  climatic  regions  and 
different  seasons.  Although  we  incorporated  enough  years  to  get 
adequate  data,  even  then  the  statistics  were  too  narrow;  they  were 
overoonstrained  and  introduced  too- much  noise.  Now  somewhere 
between  those  two  extremes — being  overly  general  or  too  restrictive — 
there  must  be  some  optimum  region  of  operation.  If  one  could  close 
the  definition  of  the  problem,  one  ought  to  be  able  to  arrive  in 
principle  at  what  is  more  nearly  an  optimxom  solution. 

King:  I think  we  must  be  careful  not  to  subvert  the  language.  I 

think  it  is  incorrect  to  speak  of  using  virtual  measurements  as 
putting  information  into  the  inversion.  I think  that  one  should 
say  instead  that  virtual  measurements  put  constraint  into  the 
inversion.  The  only  information  you  have  is  the  radiances. 

Chahine:  What  I want  to  say  is  the  following:  virtual  data  are 

not  and  cannot  be  a substitute  for  real  physical  measurements. 

We  have  to  do  our  best  to  get  the  most  accurate  measurements  for 
the  best  information  content  to  solve,  the  problem.  If  the  data 
are  incomplete, then  we  have  to  do  something  to  improve  the  solution. 
But  first  we  have  to  look  for  the  best  available  measurements, 
best  physics,  best  real  information. 

Twomey:  I would  like  to  make  a point.  The  distinction  is  not 

always  as  clear  as  some  of  the  statements  just  made  might  make  it 
appear.  I think  the  tropopause  is  a good  example.  We  have  prob- 
lems, obviously,  getting  by  any  straightforward  inversion  tech- 
nique; we  have  trouble  getting  the  tropopause  because  we  have  to 
put  in  a smoothing  operation  somewhere.  Nevertheless,  we  know 
almost  always  or  very  often  that  there  is  a tropopause  there.  In 
principle,  one  could  say  let  us  solve  the  whole  sequence  of 
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inversion  problems  in  which  we  define  the  height  and  temperature 
of  the  tropopause  and  then  get  the  best  solution  we  can,  some  con- 
strained solution.  This  would  take  a long  time,  I am  just 
pointing  out  that  it  is  possible  in  principle.  Now  whether  this 
procedure  is  reasonable  or  not  depends  on  the  error  of  the  measure- 
ments and  how  small  we  can  make  our  residuals  for  each  combination. 

In  other  words  we  would  have  to  do  a search  through  all  possible 
tropopause  levels  and  temperatures,  but  this  would  remove  the  con- 
flict between  the  smoothing  constraint,  which  we  require  for  sta- 
bility, and  the  existence  of  the  tropopause. 

van  de  Hulst:  I came  to  this  meeting  with  a very  firm  prejudice 

and  the  prejudice  is  that  instead  of  inversion  one  should  always 
do  modeling.  The  reason  why  I thought  so  is  that  the  inversion 
problem  is  similar  to  buying  a suit.  You  go  to  a store  and,  at 
least  in  Europe,  they  sometimes  have  different  floors.  On  one 
floor  is  a tailor.  He  takes  about  40  different  measurements. 

From  those  measurements  finally  after  a week  or  two,  he  comes  up 
with  a suit  that  does  not  quite  fit.  And  so  he  takes  another 
couple  of  measurements  and  then  finally  it  fits  perfectly;  it  is 
also  expensive.  The  alternative  is  that  you  address  someone  on 
the  other  floor  and  say:  my  size  is  54,  and  they  look  at  your  belly 
to  see  if  it  is  extra  big  or  not  and  those  two  parameters  together 
more  or  less  define  where  on  the  rack  you  will  find  something  which 
suits  you.  Now,  if,  in  fact,  you  try  to  pursue  this  analogy,  in 
which  both  the  number  of  parameters  and  the  cost  is  somewhat  repre- 
sentative, then  you  find  that  the  trade--off  is  not  as  simple  as 
you  might  think.  Both  methods  are  wasteful.  If  you  go  along  the 
rack,  which  has  been  called  a catalog  or  a library  in  these  dif- 
ferent talks,  then  there  is  waste  because  extra  material  is  used 
to  manufacture  also  suits  that  nobody  will  eventually  buy.  The 
other  method  is  wasteful  because  you  take  a certain  number  of 
measurements  which  are  redundant.  Finally,  the  customer  orientation, 
i.e.,  the  needs  of  the  public  and  the  responsible  bodies  in  govern- 
ment who  have  to  put  up  the  money  will  determine  where  the  actual 
trade-off  is.  I think  in  that  respect  this  particular  meeting  has 
been  quite  helpful  in  helping  all  of  us  to  get  a feel  of  where  the 
optimum  point  in  this  trade--off  might  be. 


615 


PARTICIPANTS 


J*  Abele 

Forschungsinstitut  fur  Optik 
D74  Tubingen  1 
Schloss  Kressbach 
West  Germany 

Richard  R.  Adams 
MS  234 

NASA-Langley  Research  Center 
Hampton,  VA  23665 

Bruce  R-  Barkstrom 
George  Washington  University 
MS  169,  NASA-Langley  Research 
Center 

Hampton,  VA  23665 
Jacob  Becher 

Dept,  of  Physics  & Geophysical 
Sciences,  Old  Dominion  Univ. 

P.  O,  Box  6173 
Norfolk,  VA  23508 

Sherwin  M.  Beck 
MS  401A 

NASA-Langley  Research  Center 
Hampton,  VA  23665 

Michael  Box 

Old  Dominion  University 
MS  234,  NASA-Langley  Research 
Center 

Hampton,  VA  23665 

Kurt  Bullrich 
Institut  fiir  Meteorologie 
Johannes  Gutenberg  University 
D-6500  Mainz,  West  Germany 

Todd  A.  Cerni 

University  of  Wyoming 

P-  O-  Box  3095,  Univ.  Station 

Laramie,  WY  82071 

Moustafa  T-  Chahine 
JPL,  Calif-  Inst-  of  Technology 
4800  Oak  Grove  Drive,  MS  183-301 
Pasadena,  CA  91103 

William  P-  Chu 
MS  475 

NASA-Langley  Research  Center 
Hampton,  VA  23665 


Mark  Clayson 

Dept,  of  Astro-Geophysics 
University  of  Colorado 
Boulder,  CO  80303 

Barney  Conrath 
Code  622 

Goddard  Space  Flight  Center 
Greenbelt,  MD  20771 

Adarsh  Deepak 

Old  Dominion  University,  and 
Institute  for  Atmospheric  Optics 
and  Remote  Sensing  (IFAORS) 

MS  234,  NASA-Langley  Research 
Center 

Hampton,  VA  23665 

Diran  Deirmendjian 
The  Rand  Corporation 
1700  Main  Street 
Santa  Monica,  CA  90406 

John  J.  DeLuisi 
NOAA/ERL-ARL-GMCC 
RF  3292 

Boulder,  CO  80302 

S.  Roland  Drayson 
University  of  Michigan 
Dept,  of  Atmospheric  & Oceanic 
Science 

Ann  Arbor,  MI  48109 

Henry  Fleming 
NOAA,  MS  B (NESS) 

FB  #4,  Room  0262 
Washington,  DC  20233 

Alexander  H.  Fluellen 
Atlanta  University  Center 
Box  320 — Clark  College 
Atlanta,  GA  30314 

Robert  S-  Fraser 
NASA,  Code  911 
Goddard  Space  Flight  Center 
Greenbelt,  MD  20771 

Alain  L.  Fymat 

JPL,  Calif.  Inst,  of  Technology 
4800  Oak  Grove  Drive,  MS  183B-365 
Pasadena,  CA  91103 


617 


J.  c. 

National  Center  for  Atmospheric 
Research 
P-  O.  Box  3000 
Boulder , CO  80307 

Aaron  Goldman 
Department  of  Physics 
University  of  Denver 
Denver,  CO  80210 

Richard  B.  Gomez 
Atmospheric  Science  Lab 
DRSEL-BL-MS 

White  Sands  Missile  Range 
New  Mexico  88002 

Robert  Goulard 

The  George  Washington  University 
Washington,  DC  20052 

Alex  E.  S.  Green 
Dept,  of  Physics  & Astronomy 
University  of  Florida 
Gainesville,  FL  32601 

F.  S.  Harris,  Jr. 

Old  Dominion  University 
MS  475,  NASA-Langley  Research 
Center 

Hampton,  VA  23665 

Benjamin  M.  Herman 
University  of  Arizona 
Inst,  of  Atmospheric  Physics 
Tucson,  AZ  85721 

Arlon  J.  Hunt 

Lawrence  Berkeley  Laboratory 
Building  50-205C 
Berkeley,  CA  94720 

William  M.  Irvine 
University  of  Massachusetts 
Tower  B,  Astronomy 
Amherst,  MA  01002 

Wolfgang  Jessen 
Forschungsinstitut  fur  Optik 
D-7400  Tflbingen 
Schloss  Kressbach 
West  Germany 


Lewis  D,  Kaplan 
University  of  Chicago 
Department  of  Geophysical  Sciences 
Chicago,  IL  60637 

Jean  I.  F.  King 
OPI 

Air  Force  Geophysics  Lab 
Hanscom  AFB,  MA  01731 

Jacob  G.  Kuriyan 
UCLA,  Dept,  of  Meteorology 
405  Hilgard  Avenue 
Los  Angeles,  CA  90024 

J.  Donald  Lawrence,  Jr. 

MS  401A 

NASA-rLangley  Research  Center 
Hampton,  VA  23665 

Jacqueline  Lenoble 
Laboratoire  d’Optique  Atmos. 

Univ.  de  Lille  I,  B.P,  36 
59650  Villeneuve  d*Ascq. 
t'rance 

Julius  London 
Dept,  of  Astro*-Geophysics 
University  of  Colorado 
Boulder,  CO  80303 

Leonard  R.  McMaster 
MS  234 

NASA-Langley  Research  Center 
Hampton,  VA  23665 

M.  Patrick  McCormick 
MS  475 

NASA-Langley  Research  Center 
Hampton,  VA  23665 

Michael  McElroy 
Pierce  Hall 
Harvard  University 
Cambridge,  MA  02138 

Harvey  Mai chow 
C.  S,  Draper  Lab, 

555  Technology  Square 
Cambridge,  MA  02139 

Carlton  Mateer 

Atmospheric  Environment  Serv. 

4905  Dufferin  Street 

Downsview  Ontario  M3H  5T4,  Canada 


618 


Jae  H.  Park 

College  of  William  & Mary 
MS  401A/  NASA-Langley  Research 
Center 

Hampton,  VA  23665 

Edward  M.  Patterson 
National  Center  for  Atmospheric 
Research 
P.  O.  Box  3000 
Boulder,  CO  80307 

William  A,  Pearce 
EG&G,  Washington  Analytical 
Service  Center 
6801  Kenilworth  Avenue 
Riverdale,  MD  20804 

Theodore  J.  Pepin 
University  of  Wyoming 
Dept,  of  Physics  & Astronomy 
P.  O:  Box  3095,  Univ.  Station 
Laramie,  WY  82071 

Walter  G.  Planet 
NOAA,  NESS,  MS  B 
FOB  #4 

Suit land,  MD  20233 

Heinrich  Quenzel 
Meteorologisches  Inst,  der 
UniversitSt  Milnchen 
8 Milnchen  2 .Theresienstrasse  37 
West  Germany 

John  A.  Reagan 

Eng.  Bldg,  #20 

University  of  Arizona 

Dept,  of  Electrical  Engineering 

Tucson,  AZ  85721 

Ruth  A.  Reck 

Physics  Department,  Tech.  Center 
General  Motors  Research  Labs 
Warren,  MI  48090 

Ellis  Remsberg 
MS  4 01 A 

NASA-Langley  Research  Center, 
AESD 

Hampton,  VA  23665 

Clive  D.  Rodgers 
Oxford  University,  Clarendon 
Lab  0X1,  3 PU,  England 


James  M,  Russell  III 
MS  401A 

NASA-Langley  Research  Center 
Hampton,  VA  23665 

Philip  B.  Russell 
Stanford  Research  Institute 
333  Ravenswood  Avenue 
Menlo  Park,  CA  94025 


Walter  L.  Snow 
MS  23 5A 

NASA-Langley  Research  Center 
Hampton , VA  23665 

David  H,  S tael in 

Massachusetts  Inst,  of  Technology 
Room  26-341 
Cambridge,  MA  02139 

Larry  B*  Stotts 

Naval  Elect.  Lab.  Cntr,  C-2500 
271  Catalina  Boulevard 
San  Diego,  CA  92152 

Otto  N.  Strand 
NOAA/Wave  Propagation  Lab 
Environmental  Res.  Labs,  R45x7 
Boulder,  CO  80302 

Joel  Susskind 

Goddard  Institute  for  Space  Studies 

2880  Broadway 

New  York,  NY  10025 

Thomas  J.  Swissler 
Systems  & Applied  Sciences 
MS  475,  NASA-Langley  Research 
Center 

Hampton,  VA  23665 

Didier  Tanre 

Labor ato ire  d'Optique 

Atmospherique,  D.U.S.V.A, , 

B.P.  36,  Villeneuve  D'Ascq. 

59650  France 

Morris  Tepper 
Code  ERD 

NASA  Headquarters 
Washington,  DC  20546 


Eric  P.  Shettle 
AF  Geophysics  Laboratory 
L.  G.  Hanscom  AFB 
Bedford,  MA  01731 


619 


Robert  W.  L.  Thomas 
EG&G  Washington  Analytical 
Service  Center 
6801  Kenilworth  Avenue 
Riverdale,  MD  20840 

Robert  E-  Turner 
Environmental  Research 
Institute  of  Michigan 
P.  O.  Box  618 
Ann  Arbor,  MI  48107 

Jerold  T . Twitty 
Old  Dominion  University 
MS  494,  NASA-Langley  Research 
Center 

Hampton,  VA  23665 

Sean  A.  Twomey 
University  of  Arizona 
Institute  of  Atmospheric  Physics 
Tucson,  AZ  85721 

Hendrik  C.  van  de  Hulst 
Leiden  University,  Huygens 
Laboratory 
78  Wassenaarse  Weg 
Leiden  240  Netherlands 

David  Wark 
NOAA/NESS 
Fed.  Bldg,  #4 
Washington , DC  20233 

Roger  L.  Weichel 
Lawrence  Livermore  Lab-142 
P.  O-  Box  808 
University  of  California 
Livermore,  CA  94550 

James  A.  Weinman 
University  of  Wisconsin 
1225  W.  Dayton  Street 
Madison,  WI  53706 

Ed  R.  Westwater 
NOAA/Wave  Propagation  Lab. 

R45x4 

Boulder,  CO  80302 


Cynthia  K.  Whitney 
C , S , Draper  Laboratory 
555  Technology  Square 
Cambridge,  MA  02139 

Harold  W.  Yates 
National  Envron.  Satellite 
Service,  NOAA 
Suitland,  MD  20233 


620 


NASA-Langley,  1977 


